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<210> 1 

<211> 3599 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> BRCA-l-Associated Protein-1 (BAP-1) 
<400> 1 

gcccgttgtc tgtgtgtggg actgaggggc cccgggggcg gtgggggctc ccggtggggg 60 
cagcggtggg gagggagggc ctggacatgg cgctgagggg ccgccccgcg ggaagatgaa 12 0 
taagggctgg ctggagctgg agagcgaccc aggcctcttc accctgctcg tggaagattt 180 
cggtgtcaag ggggtgcaag tggaggagat ctacgacctt cagagcaaat gtcagggccc 240 
tgtatatgga tttatcttcc tgttcaaatg gatcgaagag cgccggtccc ggcgaaaggt 3 00 
ctctaccttg gtggatgata cgtccgtgat tgatgatgat attgtgaata acatgttctt 360 
tgcccaccag ctgataccca actcttgtgc aactcatgcc ttgctgagcg tgctcctgaa 420 
ctgcagcagc gtggacctgg gacccaccct gagtcgcatg aaggacttca ccaagggttt 480 
cagccctgag agcaaaggat atgcgattgg caatgccccg gagttggcca aggcccataa 54 0 
tagccatgcc aggcccgagc cacgccacct ccctgagaag cagaatggcc ttagtgcagt 600 
gcggaccatg gaggcgttcc actttgtcag ctatgtgcct atcacaggcc ggctctttga 660 
gctggatggg ctgaaggtct accccattga ccatgggccc tggggggagg acgaggagtg 72 0 
gacagacaag gcccggcggg tcatcatgga gcgtatcggc ctcgccactg caggggagcc 780 
ctaccacgac atccgcttca acctgatggc agtggtgccc gaccgcagga tcaagtatga 84 0 
ggccaggctg catgtgctga aggtgaaccg tcagacagta ctagaggctc tgcagcagct 900 
gataagagta acacagccag agctgattca gacccacaag tctcaagagt cacagctgcc 960 
tgaggagtcc aagtcagcca gcaacaagtc cccgctggtg ctggaagcaa acagggcccc 102 0 
tgcagcctct gagggcaacc acacagatgg tgcagaggag gcggctggtt catgcgcaca 1080 
agccccatcc cacagccctc ccaacaaacc caagctagtg gtgaagcctc caggcagcag 1140 
cctcaatggg gttcacccca accccactcc cattgtccag cggctgccgg cctttctaga 1200 
caatcacaat tatgccaagt cccccatgca ggaggaagaa gacctggcgg caggtgtggg 12 60 
ccgcagccga gttccagtcc gcccacccca gcagtactca gatgatgagg atgactatga 1320 
ggatgacgag gaggatgacg tgcagaacac caactctgcc cttaggtata aggggaaggg 13 8 0 
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aacagggaag 
caacaccatc 
tctgtccatc 
gcagccctca 
tttcaactcg 
tgtcacctcc 
ctgcatacgc 
gcacctggct 
ctcgccctcc 
ggtcgtggaa 
gagtggggag 
gattgcaaac 
tgatgaccag 
ggctcaggaa 
aggggtcagc 
cccctacaag 
cgtgtggccc 
tagtcccagc 
tcgtgccctg 
agttggggca 
tggcccttct 
gggctccaac 
cctaggagct 
tcctagctgg 
ccagcctgtc 
ccaaactcag 
gtgagcaggc 
tgtgaactgt 
tagggctctt 
tctgtccctg 
ctccctctca 
tcctgctact 
aagacaaccc 
agcagtgcct 
aaaaaggtga 
caagtgtggc 
tggatcaatg 



ccaggggcat 
aacgtcttgg 
aagactagca 
cccaccccca 
ccactgcgct 
cacatctcca 
tacaaccgtg 
gaggatgggg 
atcagaccaa 
gccacggaca 
aaatactcac 
tatgaggcgt 
agaaggaccc 
ggcatgctgg 
atcggccggc 
gccaagcgcc 
tcaccagggt 
tggagagtcc 
aggcctgaca 
cagcgaggta 
gcctgggcag 
ccaacatgcc 
gtcctggtgg 
gggcctgggt 
cctgttgtaa 
ggacccagca 
tgctgggatc 
cttccttgtt 
cgccttcagt 
gcgcttgagg 
ggggtagcag 
ccagtttcct 
gttggagccc 
ctggagccca 
tccaagcagg 
agctgcctct 
aatgaataaa 



tgagcggttc 
ctgagaagct 
gcggggctgg 
gcaatgagag 
cgcctatccg 
aggtgctttt 
ctgtccgtga 
tgctgagtcc 
tccaaggcag 
gcagagagaa 
ccaaggagct 
gcctcaagga 
acaactacga 
ccaacctagt 
tccacaagca 
agtgaggact 
ccttccctgc 
aggccctggg 
cggcagatca 
ctgcagcttc 
cagaatatat 
accatgttga 
gcccaggtcc 
gggccctggg 
ggaagccagg 
ctgggctggg 
ccatggcctg 
ctagccaggc 
gttgtggccc 
ctcagaagag 
agacagggtt 
cagcctctgc 
ctgtgttcca 
ggccccaaca 
cccctttatc 
gggctgagca 
actctcctaa 



tgctgatggg 
caaagagtcc 
gagtccggct 
tacagacacg 
ctcagccaac 
tggagaggat 
tctgggtcct 
cctggcgctg 
ccaggggtcc 
gacggggatg 
gctggcactg 
ggaggtagag 
tgagttcatc 
ggagcagaac 
gcggaagcct 
gctggccctg 
cccacttccc 
aatgggagga 
gccccatagt 
ctccacagcc 
attttaccta 
cataagttcc 
ttgtatcatg 
ctctgggccc 
tcttctctct 
ttgggagtag 
agcagagcat 
tgttcaagac 
tagctatggg 
cctctgtcca 
gcttatagga 
aaggcactca 
gaggacctga 
cagccccatg 
tgtacatagt 
cagcttgacc 
gaatctcctg 



caactgtcag 
cagaaggacc 
gtggcagtgc 
gcctctgaga 
ccgacgcggc 
gacagcctgc 
gtcatcagca 
acagagggtg 
agcagcccag 
gtgaggcctg 
ctgaagtgtg 
aagaggaaga 
tgcaccttta 
atctccgtgc 
gaccggcgga 
actctgcagc 
cttttcccag 
accaggccac 
gctcaggagg 
ggctgtggag 
tcagagacat 
tacctgacta 
ccacggtccc 
tgctgctcta 
tcattcctct 
ggtgtcccag 
gtgggaactg 
tgctctccat 
cctaaattgg 
gcccctcagt 
agctggcacc 
gggtggggga 
tgccaagggg 
gcctctgcca 
gactgagtgg 
cctctagccc 
agaaaaaaaa 



tgctgcagcc 
tctcaattcc 
ccacacactc 
tcggcagtgc 
cctccagccc 
tgcgtgttga 
caggcctgct 
ggaagggttc 
tggagaagga 
gcgagccctt 
tggaggctga 
agttcaagat 
tctccatgct 
ggcggcgcca 
aacgctctcg 
ccactcttgc 
tattactgaa 
attccttcca 
cagcatctgg 
cagcaggacc 
ctatttttct 
tgctttctct 
aactacaggg 
gccccagcca 
taggagagtg 
tggggttggg 
ttcagtggcc 
agcaaggttc 
gctctaggtc 
attaccatgt 
actcagctct 
cagcaggatc 
taatgggccc 
gatggctttg 
ggggtgctgg 
ctgtaaatac 
aaaaaaaaa 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3599 



<210> 2 
<211> 729 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> BRCA-l-Associated Protein-1 (BAP-1) 
<400> 2 

Met Asn Lys Gly Trp Leu Glu Leu Glu Ser Asp Pro Gly Leu Phe Thr 
15 10 15 

Leu Leu Val Glu Asp Phe Gly Val Lys Gly Val Gin Val Glu Glu lie 
20 25 30 

Tyr Asp Leu Gin Ser Lys Cys Gin Gly Pro Val Tyr Gly Phe lie Phe 
35 40 45 

Leu Phe Lys Trp lie Glu Glu Arg Arg Ser Arg Arg Lys Val Ser Thr 
50 55 60 



2 



Leu Val Asp Asp 
65 

Phe Phe Ala His 



Leu Ser Val Leu 
100 

Ser Arg Met Lys 
115 

Tyr Ala lie Gly 
130 

Ala Arg Pro Glu 
145 

Ala Val Arg Thr 



Thr Gly Arg Leu 
180 

His Gly Pro Trp 
195 

Val He Met Glu 
210 

Asp He Arg Phe 
225 

Tyr Glu Ala Arg 



Glu Ala Leu Gin 
260 

Thr His Lys Ser 
275 

Ser Asn Lys Ser 
290 

Ser Glu Gly Asn 
305 

Ala Gin Ala Pro 



Lys Pro Pro Gly 
340 

He Val Gin Arg 
355 

Ser Pro Met Gin 
370 




Thr Ser Val He 
70 

Gin Leu He Pro 
85 

Leu Asn Cys Ser 



Asp Phe Thr Lys 
120 

Asn Ala Pro Glu 
135 

Pro Arg His Leu 
150 

Met Glu Ala Phe 
165 

Phe Glu Leu Asp 



Gly Glu Asp Glu 
200 

Arg He Gly Leu 
215 

Asn Leu Met Ala 
230 

Leu His Val Leu 
245 

Gin Leu He Arg 



Gin Glu Ser Gin 
280 

Pro Leu Val Leu 
295 

His Thr Asp Gly 
310 

Ser His Ser Pro 
325 

Ser Ser Leu Asn 



Leu Pro Ala Phe 
360 

Glu Glu Glu Asp 
375 



Asp Asp Asp He 
75 

Asn Ser Cys Ala 
90 

Ser Val Asp Leu 
105 

Gly Phe Ser Pro 



Leu Ala Lys Ala 
140 

Pro Glu Lys Gin 
155 

His Phe Val Ser 
170 

Gly Leu Lys Val 
185 

Glu Trp Thr Asp 



Ala Thr Ala Gly 
220 

Val Val Pro Asp 
235 

Lys Val Asn Arg 
250 

Val Thr Gin Pro 
265 

Leu Pro Glu Glu 



Glu Ala Asn Arg 
300 

Ala Glu Glu Ala 
315 

Pro Asn Lys Pro 
330 

Gly Val His Pro 
345 

Leu Asp Asn His 



Leu Ala Ala Gly 
380 




Val Asn Asn Met 
80 

Thr His Ala Leu 
95 

Gly Pro Thr Leu 
110 

Glu Ser Lys Gly 
125 

His Asn Ser His 



Asn Gly Leu Ser 
160 

Tyr Val Pro He 
175 

Tyr Pro He Asp 
190 

Lys Ala Arg Arg 
205 

Glu Pro Tyr His 



Arg Arg He Lys 
240 

Gin Thr Val Leu 
255 

Glu Leu He Gin 
270 

Ser Lys Ser Ala 
285 

Ala Pro Ala Ala 



Ala Gly Ser Cys 
320 

Lys Leu Val Val 
335 

Asn Pro Thr Pro 
350 

Asn Tyr Ala Lys 
365 

Val Gly Arg Ser 
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Arg Val Pro Val 
385 

Tyr Glu Asp Asp 



Arg Tyr Lys Gly 
420 

Ala Asp Gly Gin 
435 

Ala Glu Lys Leu 
450 

lie Lys Thr Ser 
465 

His Ser Gin Pro 



Ser Glu lie Gly 
500 

Ser Ala Asn Pro 
515 

Lys Val Leu Phe 
530 

Arg Tyr Asn Arg 
545 

Leu Leu His Leu 



Glu Gly Gly Lys 
580 

Gin Gly Ser Ser 
595 

Ser Arg Glu Lys 
610 

Glu Lys Tyr Ser 
625 

Ala Glu lie Ala 



Arg Lys Lys Phe 
660 

Glu Phe lie Cys 
675 

Ala Asn Leu Val 
690 




Arg Pro Pro Gin 
390 

Glu Glu Asp Asp 
405 

Lys Gly Thr Gly 



Leu Ser Val Leu 
440 

Lys Glu Ser Gin 
455 

Ser Gly Ala Gly 
470 

Ser Pro Thr Pro 
485 

Ser Ala Phe Asn 



Thr Arg Pro Ser 
520 

Gly Glu Asp Asp 
535 

Ala Val Arg Asp 
550 

Ala Glu Asp Gly 
565 

Gly Ser Ser Pro 



Ser Pro Val Glu 
600 

Thr Gly Met Val 
615 

Pro Lys Glu Leu 
630 

Asn Tyr Glu Ala 
645 

Lys lie Asp Asp 



Thr Phe lie Ser 
680 

Glu Gin Asn lie 
695 



Gin Tyr Ser Asp 
395 

Val Gin Asn Thr 
410 

Lys Pro Gly Ala 
425 

Gin Pro Asn Thr 



Lys Asp Leu Ser 
460 

Ser Pro Ala Val 
475 

Ser Asn Glu Ser 
490 

Ser Pro Leu Arg 
505 

Ser Pro Val Thr 



Ser Leu Leu Arg 
540 

Leu Gly Pro Val 
555 

Val Leu Ser Pro 
570 

Ser lie Arg Pro 
585 

Lys Glu Val Val 



Arg Pro Gly Glu 
620 

Leu Ala Leu Leu 
635 

Cys Leu Lys Glu 
650 

Gin Arg Arg Thr 
665 

Met Leu Ala Gin 



Ser Val Arg Arg 
700 




Asp Glu Asp Asp 
400 

Asn Ser Ala Leu 
415 

Leu Ser Gly Ser 
430 

lie Asn Val Leu 
445 

lie Pro Leu Ser 



Ala Val Pro Thr 
480 

Thr Asp Thr Ala 
495 

Ser Pro lie Arg 
510 

Ser His lie Ser 
525 

Val Asp Cys lie 



lie Ser Thr Gly 
560 

Leu Ala Leu Thr 
575 

lie Gin Gly Ser 
590 

Glu Ala Thr Asp 
605 

Pro Leu Ser Gly 



Lys Cys Val Glu 
640 

Glu Val Glu Lys 
655 

His Asn Tyr Asp 
670 

Glu Gly Met Leu 
685 

Arg Gin Gly Val 
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Ser lie Gly Arg Leu His Lys Gin Arg Lys Pro Asp Arg Arg Lys Arg 
705 710 715 720 

Ser Arg Pro Tyr Lys Ala Lys Arg Gin 
725 



<210> 3 

<211> 3828 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> nuclear zinc finger protein 95 (NP95) 



<400> 3 

cgactcctta 

gtccctcccc 

acccacacgg 

caggagctgt 

gaggacggcc 

gtccgccaga 

accgactccg 

gcggccgccg 

gggctgtaca 

gaggcgcagg 

acgtccaggc 

gagaacggcg 

aagtggcagg 

aaggagcggg 

cgggaactct 

ttcgtggacg 

cccatgagac 

tgccgggtct 

tgcgatgagt 

cccagcgagg 

gcgggagagc 

tcacagcggg 

gtcccgtcca 

cgagtccagg 

agcaacgacg 

gggaattttt 

gcggaacagt 

tttgctccca 

gtcagggtgg 

aaccgctacg 

tttctcgtgt 

9 a 9999 aa 99 
gaagccctgg 
caggaggggg 
ggaggtggcc 
ccctacagtc 
ctgtggaatg 
ttgttcctga 
cggcccatca 
cgggcacagg 
caggtgaacc 
cggtgatctc 
catcggcact 
cctaaaaagg 
gaatttatgt 



gagcatggca 
tcagcgccga 
tggactcgct 
tccacgtgga 
ataccctctt 
gcctcgtgct 
gctgctgcct 
agactgacag 
aggtcaatga 
tggtcagggt 
cggcgctgga 
tggtccagat 
acctggaggt 
gcttctggta 
acgccaacgt 
aagtcttcaa 
ggaagagcgg 
gcgcctgcca 
gcgacatggc 
acgagtggta 
ggctgagaga 
actggggcaa 
accactacgg 
tcagcgagtc 
gagcgtactc 
tcacatacac 
cttgtgatca 
tcaatgacca 
tgcgcaatgt 
atggcatcta 
ggcgctacct 
accggatcaa 
ccaaccgaga 
gcttcgcgtc 
cgagcagggc 
tcacggccca 
aggtcctggc 
gtaaagtgga 
cgaccgtgtg 
tgttcagctg 
agcctctgca 
caagcacttc 
gattttgttc 
tttgtcttcc 
attctggcta 



tggctcagag 
caccatgtgg 
gtccaggctg 
gccaggcctg 
cgactacgag 
cccccacagc 
gggccagagt 
caggccagcc 
gtacgtcgat 
gacgcggaag 
ggaggacgtc 
gaactccagg 
gggccaggtg 
cgacgcggag 
ggtgctgggg 
gattgagcgg 
gccgtcctgc 
cctgtgcggg 
cttccacatc 
ctgccctgag 
gagcaagaag 
gggcatggcc 
acccatcccg 
gggtgtccat 
cctagtcctg 
gggtagtggt 
gaaactcacc 
agaaggggcc 
caagggtggc 
caaggttgtg 
tctgcggagg 
gaagctgggg 
gcgagagaag 
ccccaggacg 
cgggtccccg 
gcagagcagc 
gtcactcaag 
ggagacgttc 
ccagcacaac 
ccctgcctgc 
gaccgtcctc 
tcgacaggcg 
ttagtgggct 
ttttttttta 
aaagttggac 



gtgctggtaa 
atccaggttc 
accaaggtgg 
cagaggctgt 
gtccgcctga 
accaaggagc 
gagtcagaca 
gatgaggaca 
gctcgggaca 
gccccctccc 
atttaccacg 
gacgtccgag 
gtcatgctca 
atctccagga 
gatgattctc 
ccgggtgaag 
aagcactgca 
ggccggcagg 
tactgcctgg 
tgccggaatg 
aaggcgaaga 
tgtgtgggcc 
gggatccccg 
cggccccacg 
gcggggggct 
ggtcgagatc 
aacaccaaca 
gaggccaagg 
aagaatagca 
aaatactggc 
gacgatgatg 
ctgaccatgc 
gagaacagca 
ggcaagggca 
cgccggacat 
ctcatcagag 
gaccggccgg 
cagtgtatct 
gtgtgcaagg 
cgctacgacc 
aaccagctct 
ttttgctgaa 
taacttaaac 
tttttatttt 
ttctcagtat 



aactgatggg 
ggaccatgga 
aggagctgag 
tctacagggg 
atgacaccat 
gggactccga 
agtcctccac 
tgtgggatga 
cgaacatggg 
gggacgagcc 
tgaaatacga 
cgcgcgcccg 
actacaaccc 
agcgcgagac 
tgaacgactg 
ggagccccat 
aggacgacgt 
accccgacaa 
acccgcccct 
atgccagcga 
tggcctcggc 
gcaccaagga 
tgggcaccat 
tggctggcat 
atgaggatga 
tttccggcaa 
gggcgctggc 
actggcggtc 
agtacgcccc 
ccgagaaggg 
agcctggccc 
agtatccaga 
agagggagga 
agtggaagcg 
ccaagaaaac 
aggacaagag 
cgagcggcag 
gctgtcagga 
actgcctgga 
tgggccgcag 
tccccggcta 
aacgtgtcgg 
aggtagtgtt 
tcaaatctat 
tgtgtttagt 



ggtttttgct 
cgggaggcag 
gcggaagatc 
caaacagatg 
ccagctcctg 
gctctccgac 
ccacggcgag 
gacggaattg 
ggcgtggttt 
ctgcagctcc 
cgactacccg 
caccatcatc 
cgacaacccc 
caggacggcg 
tcggatcatc 
ggttgacaac 
gaacagactc 
gcagctcatg 
cagcagtgtt 
ggtggtactg 
cacatcgtcc 
atgtaccatc 
gtggcggttc 
acacggccgg 
cgtggaccat 
caagaggacc 
tctcaactgc 
ggggaagccg 
cgctgagggc 
gaagtccggg 
ttggacgaag 
aggctacctg 
ggaggagcag 
gaagtcggca 
caaggtggag 
caacgccaag 
cccgttccag 
gctggtgttc 
cagatccttt 
ctatgccatg 
cggcaatggc 
agggctcgtt 
tcctccgttc 
acattttcag 
tctttgaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 



5 



cataaaagcc 
actacgtggt 
caactcttta 
agcaagcatc 
tggcccgtgg 
aaagaggaaa 
tgcttagcgt 
cacgcagaaa 
tgtccgacga 
gattcgttcc 
gtcaaccaga 
ggaaccgttt 
cttacaagag 
ttttttttgt 
ttgtttttgt 
ttgcagccta 
gagcaatgtt 
agggaagaat 
ttagattctc 



tgcaatttct 
gtggaggctg 
agaaggcgac 
ttcctgacag 
cagcccgtgg 
catctcgggc 
ctgagatccg 
tggcctcaag 
aggcggccac 
ttctttctaa 
ttctagaaac 
gagccttata 
ggtttttttt 
agttactgta 
attttttttc 
tacctcaata 
atttttaaag 
gagacaattt 
agaataaatg 



cgacaaaaca 
ttgatgtttc 
aggatcagtc 
cattttgtca 
catggcgtgg 
ctagttcaaa 
cgtgaaaagt 
gggactctgc 
ggacggacgc 
agacgacagt 
tgcggtcatc 
gatcatttac 
taattttttt 
tatgtaccaa 
ttttgaaagg 
aaacagggat 
ggtttttttc 
tgtgtaggct 
tttttcacag 



acacaagatt 
tggtgtcaag 
cttctctagg 
tctaaagtcc 
ctcagctgtc 
cctttgcctc 
cctctgccca 
tccacgtggg 
cagcacacga 
ctttgttgtt 
cagttcttcc 
attcaatttt 
ttctcttaat 
gaaagatata 
gtttgttaat 
attttaaatc 
acctccttat 
ttttctaaag 
attgaaaaaa 



ttttaaagat 
ttctcagaag 
gttctggccc 
agtgacatgg 
tgttgaagtt 
aaagccatcc 
cgagagcagg 
gccaggcgtg 
agtcacgtgc 
agcactgaat 
tgacaccgga 
tttaactcag 
gaacacattt 
acgttagggt 
ttttctaatt 
acatacctgc 
tcttagatta 
tccagtactt 
aaaaaaaa 



ggaatcagaa 
ttgctgccac 
ccaaggtcag 
ttccccgtgg 
gttgcaagga 
cccaccagac 
gagttggggc 
tgactgacgc 
aagtgccttt 
tattgaaaat 
tgggtgcttg 
caagtgagaa 
tctaaatgaa 
ttggttgttt 
ttaccaaagt 
agacaaactg 
ttaatgtatt 
tgtccagatt 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3828 



<210> 4 
<211> 793 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> nuclear zinc finger protein 95 (NP95) 
<400> 4 

Met Trp lie Gin Val Arg Thr Met Asp Gly Arg Gin Thr His Thr Val 
15 10 15 

Asp Ser Leu Ser Arg Leu Thr Lys Val Glu Glu Leu Arg Arg Lys lie 
20 25 30 

Gin Glu Leu Phe His Val Glu Pro Gly Leu Gin Arg Leu Phe Tyr Arg 
35 40 45 

Gly Lys Gin Met Glu Asp Gly His Thr Leu Phe Asp Tyr Glu Val Arg 
50 55 60 

Leu Asn Asp Thr lie Gin Leu Leu Val Arg Gin Ser Leu Val Leu Pro 
65 70 75 80 

His Ser Thr Lys Glu Arg Asp Ser Glu Leu Ser Asp Thr Asp Ser Gly 
85 90 95 

Cys Cys Leu Gly Gin Ser Glu Ser Asp Lys Ser Ser Thr His Gly Glu 
100 105 110 

' Ala Ala Ala Glu Thr Asp Ser Arg Pro Ala Asp Glu Asp Met Trp Asp 
115 120 125 

Glu Thr Glu Leu Gly Leu Tyr Lys Val Asn Glu Tyr Val Asp Ala Arg 
130 135 140 

Asp Thr Asn Met Gly Ala Trp Phe Glu Ala Gin Val Val Arg Val Thr 
145 150 155 160 
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Arg Lys Ala Pro 



Ala Leu Glu Glu 
180 

Glu Asn Gly Val 
195 

Arg Thr lie lie 
210 

Leu Asn Tyr Asn 
225 

Ala Glu lie Ser 



Ala Asn Val Val 
260 

Phe Val Asp Glu 
275 

Met Val Asp Asn 
290 

Cys Lys Asp Asp 
305 

Cys Gly Gly Arg 



Asp Met Ala Phe 
340 

Pro Ser Glu Asp 
355 

Glu Val Val Leu 
370 

Lys Met Ala Ser 
385 

Met Ala Cys Val 



His Tyr Gly Pro 
420 

Arg Val Gin Val 
435 

lie His Gly Arg 
450 

Gly Tyr Glu Asp 
465 




Ser Arg Asp Glu 
165 

Asp Val lie Tyr 



Val Gin Met Asn 
200 

Lys Trp Gin Asp 
215 

Pro Asp Asn Pro 
230 

Arg Lys Arg Glu 
245 

Leu Gly Asp Asp 



Val Phe Lys lie 
280 

Pro Met Arg Arg 
295 

Val Asn Arg Leu 
310 

Gin Asp Pro Asp 
325 

His lie Tyr Cys 



Glu Trp Tyr Cys 
360 

Ala Gly Glu Arg 
375 

Ala Thr Ser Ser 
390 

Gly Arg Thr Lys 
405 

lie Pro Gly He 



Ser Glu Ser Gly 
440 

Ser Asn Asp Gly 
455 

Asp Val Asp His 
470 



Pro Cys Ser Ser 
170 

His Val Lys Tyr 
185 

Ser Arg Asp Val 



Leu Glu Val Gly 
220 

Lys Glu Arg Gly 
235 

Thr Arg Thr Ala 
250 

Ser Leu Asn Asp 
265 

Glu Arg Pro Gly 



Lys Ser Gly Pro 
300 

Cys Arg Val Cys 
315 

Lys Gin Leu Met 
330 

Leu Asp Pro Pro 
345 

Pro Glu Cys Arg 



Leu Arg Glu Ser 
380 

Ser Gin Arg Asp 
395 

Glu Cys Thr He 
410 

Pro Val Gly Thr 
425 

Val His Arg Pro 



Ala Tyr Ser Leu 
460 

Gly Asn Phe Phe 
475 




Thr Ser Arg Pro 
175 

Asp Asp Tyr Pro 
190 

Arg Ala Arg Ala 
205 

Gin Val Val Met 



Phe Trp Tyr Asp 
240 

Arg Glu Leu Tyr 
255 

Cys Arg He He 
270 

Glu Gly Ser Pro 
285 

Ser Cys Lys His 



Ala Cys His Leu 
320 

Cys Asp Glu Cys 
335 

Leu Ser Ser Val 
350 

Asn Asp Ala Ser 
365 

Lys Lys Lys Ala 



Trp Gly Lys Gly 
400 

Val Pro Ser Asn 
415 

Met Trp Arg Phe 
430 

His Val Ala Gly 
445 

Val Leu Ala Gly 



Thr Tyr Thr Gly 
480 
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Ser Gly Gly Arg Asp Leu Ser Gly Asn Lys Arg Thr Ala Glu Gin Ser 
485 490 495 

Cys Asp Gin Lys Leu Thr Asn Thr Asn Arg Ala Leu Ala Leu Asn Cys 
500 505 510 

Phe Ala Pro lie Asn Asp Gin Glu Gly Ala Glu Ala Lys Asp Trp Arg 
515 520 525 

Ser Gly Lys Pro Val Arg Val Val Arg Asn Val Lys Gly Gly Lys Asn 
530 535 540 

Ser Lys Tyr Ala Pro Ala Glu Gly Asn Arg Tyr Asp Gly lie Tyr Lys 
545 550 555 560 

Val Val Lys Tyr Trp Pro Glu Lys Gly Lys Ser Gly Phe Leu Val Trp 
565 570 575 

Arg Tyr Leu Leu Arg Arg Asp Asp Asp Glu Pro Gly Pro Trp Thr Lys 
580 585 590 

Glu Gly Lys Asp Arg lie Lys Lys Leu Gly Leu Thr Met Gin Tyr Pro 
595 600 605 

Glu Gly Tyr Leu Glu Ala Leu Ala Asn Arg Glu Arg Glu Lys Glu Asn 
610 615 620 

Ser Lys Arg Glu Glu Glu Glu Gin Gin Glu Gly Gly Phe Ala Ser Pro 
625 630 635 640 

Arg Thr Gly Lys Gly Lys Trp Lys Arg Lys Ser Ala Gly Gly Gly Pro 
645 650 655 

Ser Arg Ala Gly Ser Pro Arg Arg Thr Ser Lys Lys Thr Lys Val Glu 
660 665 670 

Pro Tyr Ser Leu Thr Ala Gin Gin Ser Ser Leu lie Arg Glu Asp Lys 
675 680 685 

Ser Asn Ala Lys Leu Trp Asn Glu Val Leu Ala Ser Leu Lys Asp Arg 
690 695 700 

Pro Ala Ser Gly Ser Pro Phe Gin Leu Phe Leu Ser Lys Val Glu Glu 
705 710 715 720 

Thr Phe Gin Cys lie Cys Cys Gin Glu Leu Val Phe Arg Pro lie Thr 
725 730 735 

Thr Val Cys Gin His Asn Val Cys Lys Asp Cys Leu Asp Arg Ser Phe 
740 745 750 

Arg Ala Gin Val Phe Ser Cys Pro Ala Cys Arg Tyr Asp Leu Gly Arg 
755 760 765 

Ser Tyr Ala Met Gin Val Asn Gin Pro Leu Gin Thr Val Leu Asn Gin 
770 775 780 

Leu Phe Pro Gly Tyr Gly Asn Gly Arg 
785 790 
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<210> 5 

<211> 5503 

<212> DNA 

<213> Homo sapiens 




<220> 

<223> Fanconi anemia, complementation group A protein 
( FANCA) (aldehyde dehydrogenase activity) cDNA 



<400> 5 

agccgccgcc ggggctgtag gcgccaaggc catgtccgac tcgtgggtcc cgaactccgc 60 
ctcgggccag gacccagggg gccgccggag ggcctgggcc gagctgctgg cgggaagggt 12 0 
caagagggaa aaatataatc ctgaaagggc acagaaatta aaggaatcag ctgtgcgcct 180 
cctgcgaagc catcaggacc tgaatgccct tttgcttgag gtagaaggtc cactgtgtaa 240 
aaaattgtct ctcagcaaag tgattgactg tgacagttct gaggcctatg ctaatcattc 300 
tagttcattt ataggctctg ctttgcagga tcaagcctca aggctggggg ttcccgtggg 3 60 
tattctctca gccgggatgg ttgcctctag cgtgggacag atctgcacgg ctccagcgga 42 0 
gaccagtcac cctgtgctgc tgactgtgga gcagagaaag aagctgtctt ccctgttaga 480 
gtttgctcag tatttattgg cacacagtat gttctcccgt ctttccttct gtcaagaatt 540 
atggaaaata cagagttctt tgttgcttga agcggtgtgg catcttcacg tacaaggcat 600 
tgtgagcctg caagagctgc tggaaagcca tcccgacatg catgctgtgg gatcgtggct 660 
cttcaggaat ctgtgctgcc tttgtgaaca gatggaagca tcctgccagc atgctgacgt 72 0 
cgccagggcc atgctttctg attttgttca aatgtttgtt ttgaggggat ttcagaaaaa 780 
ctcagatctg agaagaactg tggagcctga aaaaatgccg caggtcacgg ttgatgtact 840 
gcagagaatg ctgatttttg cacttgacgc tttggctgct ggagtacagg aggagtcctc 900 
cactcacaag atcgtgaggt gctggttcgg agtgttcagt ggacacacgc ttggcagtgt 960 
aatttccaca gatcctctga agaggttctt cagtcatacc ctgactcaga tactcactca 1020 
cagccctgtg ctgaaagcat ctgatgctgt tcagatgcag agagagtgga gctttgcgcg 1080 
gacacaccct ctgctcacct cactgtaccg caggctcttt gtgatgctga gtgcagagga 1140 
gttggttggc catttgcaag aagttctgga aacgcaggag gttcactggc agagagtgct 1200 
ctcctttgtg tctgccctgg ttgtctgctt tccagaagcg cagcagctgc ttgaagactg 1260 
ggtggcgcgt ttgatggccc aggcattcga gagctgccag ctggacagca tggtcactgc 132 0 
gttcctggtt gtgcgccagg cagcactgga gggcccctct gcgttcctgt catatgcaga 1380 
ctggttcaag gcctcctttg ggagcacacg aggctaccat ggctgcagca agaaggccct 1440 
ggtcttcctg tttacgttct tgtcagaact cgtgcctttt gagtctcccc ggtacctgca 1500 
ggtgcacatt ctccacccac ccctggttcc cagcaagtac cgctccctcc tcacagacta 1560 
catctcattg gccaagacac ggctggccga cctcaaggtt tctatagaaa acatgggact 1620 
ctacgaggat ttgtcatcag ctggggacat tactgagccc cacagccaag ctcttcagga 1680 
tgttgaaaag gccatcatgg tgtttgagca tacggggaac atcccagtca ccgtcatgga 1740 
ggccagcata ttcaggaggc cttactacgt gtcccacttc ctccccgccc tgctcacacc 1800 
tcgagtgctc cccaaagtcc ctgactcccg tgtggcgttt atagagtctc tgaagagagc 1860 
agataaaatc cccccatctc tgtactccac ctactgccag gcctgctctg ctgctgaaga 1920 
gaagccagaa gatgcagccc tgggagtgag ggcagaaccc aactctgctg aggagcccct 1980 
gggacagctc acagctgcac tgggagagct gagagcctcc atgacagacc ccagccagcg 2 04 0 
tgatgttata tcggcacagg tggcagtgat ttctgaaaga ctgagggctg tcctgggcca 2100 
caatgaggat gacagcagcg ttgagatatc aaagattcag ctcagcatca acacgccgag 2160 
actggagcca cgggaacaca ttgctgtgga cctcctgctg acgtctttct gtcagaacct 2220 
gatggctgcc tccagtgtcg ctcccccgga gaggcagggt ccctgggctg ccctcttcgt 22 80 
gaggaccatg tgtggacgtg tgctccctgc agtgctcacc cggctctgcc agctgctccg 2340 
tcaccagggc ccgagcctga gtgccccaca tgtgctgggg ttggctgccc tggccgtgca 2400 
cctgggtgag tccaggtctg cgctcccaga ggtggatgtg ggtcctcctg cacctggtgc 2460 
tggccttcct gtccctgcgc tctttgacag cctcctgacc tgtaggacga gggattcctt 2520 
gttcttctgc ctgaaatttt gtacagcagc aatttcttac tctctctgca agttttcttc 2580 
ccagtcacga gatactttgt gcagctgctt atctccaggc cttattaaaa agtttcagtt 2640 
cctcatgttc agattgttct cagaggcccg acagcctctt tctgaggagg acgtagccag 2700 
cctttcctgg agacccttgc accttccttc tgcagactgg cagagagctg ccctctctct 2760 
ctggacacac agaaccttcc gagaggtgtt gaaagaggaa gatgttcact taacttacca 2820 
agactggtta cacctggagc tggaaattca acctgaagct gatgctcttt cagatactga 2880 
acggcaggac ttccaccagt gggcgatcca tgagcacttt ctccctgagt cctcggcttc 2 940 
agggggctgt gacggagacc tgcaggctgc gtgtaccatt cttgtcaacg cactgatgga 3000 
tttccaccaa agctcaagga gttatgacca ctcagaaaat tctgatttgg tctttggtgg 3060 
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ccgcacagga 
gcagcaagac 
gattttccgc 
gagacagagg 
ctgcggcagc 
cttggtcaac 
cactgcccac 
gatggtcgac 
tctggtgtgg 
gagcccgctg 
cctctcccct 
gcactttgcg 
ctgcgagaga 
gtcacatctg 
aatcctcgag 
gagtgacctc 
gctgcctttc 
agaggccttc 
tggggataca 
caaccccgtg 
cccgaaaaag 
agaggtgagc 
ggagcctcat 
aatttattac 
tcgactgctt 
cagcgtttgc 
caccaagccc 
cacacgaagg 
ctcactgccg 
ggctcggtgg 
tcagcgggtg 
cacactggtc 
tgagggatgc 
gataagaagg 
gcttggctcc 
gtccacattc 
ggcccctgat 
gggaagacat 
gcctcgttta 
gtgcatttca 
aaaaaaaaaa 



aatgaggata 
ctcatagtgc 
agacggctcc 
gagctgctaa 
agcttccagg 
tctgagatga 
ttcttcaggg 
ttcatactgg 
tggccgagcc 
ccccgggaac 
gaggctgcct 
attcaacaag 
gaggagctat 
acctcaaata 
tgtttagaga 
aggctggggc 
gctttttaca 
ctgcatgttg 
agcacagttt 
gaactgataa 
agcttctcac 
gccgccctcc 
ctcttctgac 
aagcataaca 
tagtggggaa 
gggaaaataa 
gcctccagca 
gtgaggctga 
tcctcaggtg 
cccaggtggt 
caccatggac 
acaggcaaag 
ccgctgcctg 
tgcgaggcca 
cgaatgtcgc 
gtcacagata 
gctccaacct 
ttctgcacat 
ttaagatctt 
ggatggtttt 
aaaaaaaaaa 



ttatttccag 
ctctcggcca 
aggctctgac 
tgtacaaacg 
cagaacagcc 
gaaacttctg 
gcctcctgaa 
ccaagtgcca 
tggagcctgt 
tgcagaagct 
ccccagcacc 
tcagggaaga 
tggttttcct 
gcaccacaga 
agaggaagat 
ggctcctcct 
gtcttctctc 
ctgtggacat 
cacctccagc 
caaaagctcg 
acgtggcaga 
agagcagaca 
gggacctgcc 
tggagctctt 
aggaatcaat 
accactggtc 
ccaagggcgg 
cacagccact 
ggttcgggct 
ggttccgcct 
atgtgtacat 
tccagctcag 
cactggaacc 
cagccctggg 
atttggtgga 
tagttccgca 
cccgggggga 
ggttcaccat 
taaactgctt 
taaagaaacc 
aaaaaaaaaa 



attgcaggag 
caccccttcc 
aagcgggtgg 
gatcctcctc 
catcactgcc 
ctcccacgga 
cgcctgtctg 
gacgaaatgc 
gctgctctgc 
acaagaaggc 
caacccggac 
aaacatcagg 
tttcttcttc 
cctgccaaag 
atcctggctg 
ccgtgtggcc 
ctacttccat 
gtacttgaag 
tggcaggagc 
tctttttctg 
gctgctggct 
gcaggctgcc 
actgcacacc 
gttgcactaa 
tatttatgaa 
ccagagcaga 
gcagcaccct 
gcggagtcca 
tcaccgcctg 
ccaggggcag 
tgaggttgtg 
tctcagcctt 
cacagacctc 
agggggtcct 
cgagaaggtg 
cctctgagag 
cgacgatgac 
gcagtgggcc 
tatacactgt 
tcagaaagct 
aaaaaaaaaa 



atggtagctg 
caggagcact 
agcgtggctg 
cgcctgcctt 
agatgcgagc 
ggtgccctga 
cggagcagag 
cccttaattt 
cggtggagga 
cggcagtttg 
tggctctcag 
aagcagctaa 
tccttgatgg 
gctttccacg 
gcactctttc 
ccggatcagc 
gaagacgcgg 
ctggtccagc 
ctggagctca 
ctgcagttaa 
gatcgtgggg 
cctgacgctg 
agcccagctc 
aaagtggatt 
ctgtccggcc 
ggaaggctac 
ccgaccctcc 
ggctgctaga 
gccctctgtg 
ggccttgtcc 
ggccttctca 
gtgtttggtc 
acacctgggg 
gactcacact 
cttccgctgc 
gggagagtcc 
aatgtgaaac 
caagcaaggg 
cacgtggctt 
atttccttaa 
aaa 



acctggagct 
tcctctttga 
ccagccttca 
cgtctgtcct 
agttcttcca 
cacaggacat 
acccctccct 
tgacctctgc 
gacactgcca 
ccagcgattt 
ctgctgcact 
agaagctgga 
gcctgctgtc 
tttgtgcagc 
agttgacaga 
acaccaggct 
ccatcaggga 
tcttcgtggc 
agggtcaggg 
tacctcggtg 
actgcgaccc 
acctgtccca 
ccgtgtaaat 
acaaatctcc 
ccgagtcact 
ttgagccgga 
catgcgggtg 
ggtgctcatc 
gtcacagagg 
tgggtctgtg 
aaccgccggc 
atgtggtact 
gacagaggca 
tactgcaaag 
ttgaaggttt 
agtgagtcca 
catcacagct 
gcctatgagg 
catcagctgt 
aaaaaaaaaa 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5503 



<210> 6 
<211> 1455 
J <212> PRT 

<213> Homo sapiens 

<220> 

<22 3> Fanconi anemia, complementation group A protein 
(FANCA) (aldehyde dehydrogenase activity) 

<400> 6 

Met Ser Asp Ser Trp Val Pro Asn Ser Ala Ser Gly Gin Asp Pro Gly 
15 10 15 

Gly Arg Arg Arg Ala Trp Ala Glu Leu Leu Ala Gly Arg Val Lys Arg 
20 25 30 

Glu Lys Tyr Asn Pro Glu Arg Ala Gin Lys Leu Lys Glu Ser Ala Val 
35 40 45 
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Arg Leu Leu Arg 
50 

Glu Gly Pro Leu 
65 

Asp Ser Ser Glu 



Ala Leu Gin Asp 
100 

Ser Ala Gly Met 
115 

Ala Glu Thr Ser 
130 

Leu Ser Ser Leu 
145 

Phe Ser Arg Leu 



Leu Leu Leu Glu 
180 

Leu Gin Glu Leu 
195 

Trp Leu Phe Arg 
210 

Cys Gin His Ala 
225 

Met Phe Val Leu 



Val Glu Pro Glu 
260 

Met Leu lie Phe 
275 

Ser Ser Thr His 
290 

His Thr Leu Gly 
305 

Ser His Thr Leu 



Ser Asp Ala Val 
340 

Pro Leu Leu Thr 
355 




Ser His Gin Asp 
55 

Cys Lys Lys Leu 
70 

Ala Tyr Ala Asn 
85 

Gin Ala Ser Arg 



Val Ala Ser Ser 
120 

His Pro Val Leu 
135 

Leu Glu Phe Ala 
150 

Ser Phe Cys Gin 
165 

Ala Val Trp His 



Leu Glu Ser His 
200 

Asn Leu Cys Cys 
215 

Asp Val Ala Arg 
230 

Arg Gly Phe Gin 
245 

Lys Met Pro Gin 



Ala Leu Asp Ala 
280 

Lys lie Val Arg 
295 

Ser Val lie Ser 
310 

Thr Gin lie Leu 
325 

Gin Met Gin Arg 



Ser Leu Tyr Arg 
360 



Leu Asn Ala Leu 
60 

Ser Leu Ser Lys 
75 

His Ser Ser Ser 
90 

Leu Gly Val Pro 
105 

Val Gly Gin He 



Leu Thr Val Glu 
140 

Gin Tyr Leu Leu 
155 

Glu Leu Trp Lys 
170 

Leu His Val Gin 
185 

Pro Asp Met His 



Leu Cys Glu Gin 
220 

Ala Met Leu Ser 
235 

Lys Asn Ser Asp 
250 

Val Thr Val Asp 
265 

Leu Ala Ala Gly 



Cys Trp Phe Gly 
300 

Thr Asp Pro Leu 
315 

Thr His Ser Pro 
330 

Glu Trp Ser Phe 
345 

Arg Leu Phe Val 




Leu Leu Glu Val 



Val He Asp Cys 
80 

Phe He Gly Ser 
95 

Val Gly He Leu 
110 

Cys Thr Ala Pro 
125 

Gin Arg Lys Lys 



Ala His Ser Met 
160 

He Gin Ser Ser 
175 

Gly He Val Ser 
190 

Ala Val Gly Ser 
205 

Met Glu Ala Ser 



Asp Phe Val Gin 
240 

Leu Arg Arg Thr 
255 

Val Leu Gin Arg 
270 

Val Gin Glu Glu 
285 

Val Phe Ser Gly 



Lys Arg Phe Phe 
320 

Val Leu Lys Ala 
335 

Ala Arg Thr His 
350 

Met Leu Ser Ala 
365 
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Glu Glu Leu Val 
370 

His Trp Gin Arg 
385 

Pro Glu Ala Gin 



Gin Ala Phe Glu 
420 

Val Val Arg Gin 
435 

Ala Asp Trp Phe 
450 

Cys Ser Lys Lys 
465 

Val Pro Phe Glu 



Pro Leu Val Pro 
500 

Leu Ala Lys Thr 
515 

Gly Leu Tyr Glu 
530 

Ser Gin Ala Leu 
545 

Thr Gly Asn lie 



Pro Tyr Tyr Val 
580 

Leu Pro Lys Val 
595 

Arg Ala Asp Lys 
610 

Cys Ser Ala Ala 
625 

Ala Glu Pro Asn 



Leu Gly Glu Leu 
660 

lie Ser Ala Gin 
675 



Gly His Leu Gin 
375 

Val Leu Ser Phe 
390 

Gin Leu Leu Glu 
405 

Ser Cys Gin Leu 



Ala Ala Leu Glu 
440 

Lys Ala Ser Phe 
455 

Ala Leu Val Phe 
470 

Ser Pro Arg Tyr 
485 

Ser Lys Tyr Arg 



Arg Leu Ala Asp 
520 

Asp Leu Ser Ser 
535 

Gin Asp Val Glu 
550 

Pro Val Thr Val 
565 

Ser His Phe Leu 



Pro Asp Ser Arg 
600 

lie Pro Pro Ser 
615 

Glu Glu Lys Pro 
630 

Ser Ala Glu Glu 
645 

Arg Ala Ser Met 



Val Ala Val lie 
680 



Glu Val Leu Glu 
380 

Val Ser Ala Leu 
395 

Asp Trp Val Ala 
410 

Asp Ser Met Val 
425 

Gly Pro Ser Ala 



Gly Ser Thr Arg 
460 

Leu Phe Thr Phe 
475 

Leu Gin Val His 
490 

Ser Leu Leu Thr 
505 

Leu Lys Val Ser 



Ala Gly Asp lie 
540 

Lys Ala lie Met 
555 

Met Glu Ala Ser 
570 

Pro Ala Leu Leu 
585 

Val Ala Phe lie 



Leu Tyr Ser Thr 
620 

Glu Asp Ala Ala 
635 

Pro Leu Gly Gin 
650 

Thr Asp Pro Ser 
665 

Ser Glu Arg Leu 



Thr Gin Glu Val 



Val Val Cys Phe 
400 

Arg Leu Met Ala 
415 

Thr Ala Phe Leu 
430 

Phe Leu Ser Tyr 
445 

Gly Tyr His Gly 



Leu Ser Glu Leu 
480 

lie Leu His Pro 
495 

Asp Tyr lie Ser 
510 

lie Glu Asn Met 
525 

Thr Glu Pro His 



Val Phe Glu His 
560 

lie Phe Arg Arg 
575 

Thr Pro Arg Val 
590 

Glu Ser Leu Lys 
605 

Tyr Cys Gin Ala 



Leu Gly Val Arg 
640 

Leu Thr Ala Ala 
655 

Gin Arg Asp Val 
670 

Arg Ala Val Leu 
685 
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Gly His Asn Glu Asp Asp Ser Ser Val Glu lie Ser Lys lie Gin Leu 
690 695 700 

Ser lie Asn Thr Pro Arg Leu Glu Pro Arg Glu His lie Ala Val Asp 
705 710 715 720 

Leu Leu Leu Thr Ser Phe Cys Gin Asn Leu Met Ala Ala Ser Ser Val 
725 730 735 

Ala Pro Pro Glu Arg Gin Gly Pro Trp Ala Ala Leu Phe Val Arg Thr 
740 745 750 

Met Cys Gly Arg Val Leu Pro Ala Val Leu Thr Arg Leu Cys Gin Leu 
755 760 765 

Leu Arg His Gin Gly Pro Ser Leu Ser Ala Pro His Val Leu Gly Leu 
770 775 780 

Ala Ala Leu Ala Val His Leu Gly Glu Ser Arg Ser Ala Leu Pro Glu 
785 790 795 800 

Val Asp Val Gly Pro Pro Ala Pro Gly Ala Gly Leu Pro Val Pro Ala 
805 810 815 

Leu Phe Asp Ser Leu Leu Thr Cys Arg Thr Arg Asp Ser Leu Phe Phe 
820 825 830 

Cys Leu Lys Phe Cys Thr Ala Ala lie Ser Tyr Ser Leu Cys Lys Phe 
835 840 845 

Ser Ser Gin Ser Arg Asp Thr Leu Cys Ser Cys Leu Ser Pro Gly Leu 
850 855 860 

lie Lys Lys Phe Gin Phe Leu Met Phe Arg Leu Phe Ser Glu Ala Arg 
865 870 875 880 

Gin Pro Leu Ser Glu Glu Asp Val Ala Ser Leu Ser Trp Arg Pro Leu 
885 890 895 

His Leu Pro Ser Ala Asp Trp Gin Arg Ala Ala Leu Ser Leu Trp Thr 
900 905 910 

His Arg Thr Phe Arg Glu Val Leu Lys Glu Glu Asp Val His Leu Thr 
915 920 925 

Tyr Gin Asp Trp Leu His Leu Glu Leu Glu lie Gin Pro Glu Ala Asp 
930 935 940 

Ala Leu Ser Asp Thr Glu Arg Gin Asp Phe His Gin Trp Ala lie His 
945 950 955 960 

Glu His Phe Leu Pro Glu Ser Ser Ala Ser Gly Gly Cys Asp Gly Asp 
965 970 975 

Leu Gin Ala Ala Cys Thr lie Leu Val Asn Ala Leu Met Asp Phe His 
980 985 990 

Gin Ser Ser Arg Ser Tyr Asp His Ser Glu Asn Ser Asp Leu Val Phe 
995 1000 1005 
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Gly Gly Arg Thr Gly Asn Glu Asp lie lie Ser Arg Leu Gin Glu Met 
1010 1015 1020 

Val Ala Asp Leu Glu Leu Gin Gin Asp Leu lie Val Pro Leu Gly His 
1025 1030 1035 1040 

Thr Pro Ser Gin Glu His Phe Leu Phe Glu lie Phe Arg Arg Arg Leu 
1045 1050 1055 

Gin Ala Leu Thr Ser Gly Trp Ser Val Ala Ala Ser Leu Gin Arg Gin 
1060 1065 1070 

Arg Glu Leu Leu Met Tyr Lys Arg lie Leu Leu Arg Leu Pro Ser Ser 
1075 1080 1085 

Val Leu Cys Gly Ser Ser Phe Gin Ala Glu Gin Pro lie Thr Ala Arg 
1090 1095 1100 

Cys Glu Gin Phe Phe His Leu Val Asn Ser Glu Met Arg Asn Phe Cys 
1105 1110 1115 1120 

Ser His Gly Gly Ala Leu Thr Gin Asp lie Thr Ala His Phe Phe Arg 
1125 1130 1135 

Gly Leu Leu Asn Ala Cys Leu Arg Ser Arg Asp Pro Ser Leu Met Val 
1140 1145 1150 

Asp Phe lie Leu Ala Lys Cys Gin Thr Lys Cys Pro Leu lie Leu Thr 
1155 1160 1165 

Ser Ala Leu Val Trp Trp Pro Ser Leu Glu Pro Val Leu Leu Cys Arg 
1170 1175 1180 

Trp Arg Arg His Cys Gin Ser Pro Leu Pro Arg Glu Leu Gin Lys Leu 
1185 1190 1195 1200 

Gin Glu Gly Arg Gin Phe Ala Ser Asp Phe Leu Ser Pro Glu Ala Ala 
1205 1210 1215 

Ser Pro Ala Pro Asn Pro Asp Trp Leu Ser Ala Ala Ala Leu His Phe 
1220 1225 1230 

Ala lie Gin Gin Val Arg Glu Glu Asn lie Arg Lys Gin Leu Lys Lys 
1235 1240 1245 

Leu Asp Cys Glu Arg Glu Glu Leu Leu Val Phe Leu Phe Phe Phe Ser 
1250 1255 1260 

Leu Met Gly Leu Leu Ser Ser His Leu Thr Ser Asn Ser Thr Thr Asp 
1265 1270 1275 1280 

Leu Pro Lys Ala Phe His Val Cys Ala Ala lie Leu Glu Cys Leu Glu 
1285 1290 1295 

Lys Arg Lys lie Ser Trp Leu Ala Leu Phe Gin Leu Thr Glu Ser Asp 
1300 1305 1310 

Leu Arg Leu Gly Arg Leu Leu Leu Arg Val Ala Pro Asp Gin His Thr 
1315 1320 1325 
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Arg Leu Leu Pro Phe Ala Phe Tyr Ser Leu Leu Ser Tyr Phe His Glu 
1330 1335 1340 

Asp Ala Ala lie Arg Glu Glu Ala Phe Leu His Val Ala Val Asp Met 
1345 1350 1355 1360 

Tyr Leu Lys Leu Val Gin Leu Phe Val Ala Gly Asp Thr Ser Thr Val 
1365 1370 1375 

Ser Pro Pro Ala Gly Arg Ser Leu Glu Leu Lys Gly Gin Gly Asn Pro 
1380 1385 1390 

Val Glu Leu lie Thr Lys Ala Arg Leu Phe Leu Leu Gin Leu lie Pro 
1395 1400 1405 

Arg Cys Pro Lys Lys Ser Phe Ser His Val Ala Glu Leu Leu Ala Asp 
1410 1415 1420 

Arg Gly Asp Cys Asp Pro Glu Val Ser Ala Ala Leu Gin Ser Arg Gin 
1425 1430 1435 1440 

Gin Ala Ala Pro Asp Ala Asp Leu Ser Gin Glu Pro His Leu Phe 
1445 1450 1455 



<210> 7 
<211> 1378 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> DEAD/H (Asp-Glu-Ala-Asp/His ) box polypeptide 9 

(DDX9) (RNA helicase A, nuclear DNA helicase II, 
leukophysin) , transcript variant 2 cDNA 



<400> 7 

cattgctgct 

tatccatcga 

ccaagcctgg 

caaaagactt 

gattttgatt 

tggaccagat 

aagtacccat 

aaaggcatga 

tctgatgggc 

gctgcctgta 

caacctgcta 

cagatctcta 

gatggtccac 

ggttctagtt 

ggaggctatg 

ggctatagag 

cctagtggag 

ggaactggct 

gttcctgtgt 

tatgtttatt 

ttaaggaaac 

gatctcttaa 

ataacttggt 



gctacctgct 
aattttgctg 
gatgatgcta 
aatatggcta 
aattctgggt 
aataatttgg 
ctcccttctt 
ctttagtcac 
agattgtgct 
tcactggtct 
tcatcagcca 
gaccctcagc 
gtcctcccaa 
acagtggtgg 
gtggcagcgc 
gagtttcccg 
gctacagagg 
actttggaca 
gtagacagta 
tgccaccaaa 
caagcatata 
aaataccaca 
attttcctgg 



ttccagagcc 
gaaacagatt 
gaatgggtgg 
cactaagaat 
ttccagaaga 
atgttgttat 
tgtatttggt 
ccccctgcag 
tgtagatgac 
ccgggcagcc 
gttggacccc 
tgctggtatc 
gatggcccga 
aggctatggc 
caactccttt 

aggtggcttt 

atctggggga 
gggaagagga 
aggaaaaaaa 
aagtaaatgc 
gatgcattag 
gtttgtattt 
ctttcgttta 



tttcatcaat 
ttctgatcac 
agaagaagca 
gacctgggaa 
ttgtttgttg 
ctccctcctg 
gaaaagattc 
ttgcttctct 
tggattaaac 
atggaggctt 
gtaaatgaac 
aaccttatga 
tacgacaatg 
ggtggctata 
cgggcaggat 
agaggcaact 
ttccagcgag 
ggtggcggct 
ggcatgctat 
attttcaccc 
tgattttgtt 
tttctttaag 
atacaataga 



gaaggaaagc 
gtagcccttt 
gagatacgtt 
gccaaagttc 
acacaagtgt 
gcctttgtag 
gaactcgagc 
ttgcctccaa 
tgcaaatatc 
tggttgttga 
gtatgctgaa 
ttggcagtac 
gaagcggata 
gcagtggagg 
atggtgcagg 
ctggaggaga 
gaggtggtag 
attaaaactt 
gtgttacgtg 
attctgtggt 
tatattatgt 
gagtaaagat 
aaataaagta 



ggctgggcta 
tatcagtatt 
tttgtgagca 
agctcaaaga 
ttactaacac 
ccaagacatg 
catctctgct 
gaaagtccaa 
tcatgaagct 
agtaaccaaa 
catgatccgt 
acggtatgga 
tagaagggga 
ctatggtagc 
tgttggtgga 
ctacagaggg 
gggggcctat 
ggttatgtca 
ttttttccag 
tcattgtagt 
aaaatataac 
ttgcctttaa 
ttacaccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1378 
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<210> 8 
<211> 235 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> DEAD/H (Asp-Glu-Ala-Asp/His ) box polypeptide 9 

(DDX9) (RNA helicase A, nuclear DNA helicase II, 
leukophysin) , transcript variant 2 

<400> 8 

Met Lys Tyr Pro Ser Pro Phe Phe Val Phe Gly Glu Lys lie Arg Thr 
15 10 15 

Arg Ala lie Ser Ala Lys Gly Met Thr Leu Val Thr Pro Leu Gin Leu 
20 25 30 

Leu Leu Phe Ala Ser Lys Lys Val Gin Ser Asp Gly Gin lie Val Leu 
35 40 45 

Val Asp Asp Trp lie Lys Leu Gin lie Ser His Glu Ala Ala Ala Cys 
50 55 60 

He Thr Gly Leu Arg Ala Ala Met Glu Ala Leu Val Val Glu Val Thr 
65 70 75 80 

Lys Gin Pro Ala He He Ser Gin Leu Asp Pro Val Asn Glu Arg Met 
85 90 95 

Leu Asn Met He Arg Gin He Ser Arg Pro Ser Ala Ala Gly He Asn 
100 105 110 

Leu Met He Gly Ser Thr Arg Tyr Gly Asp Gly Pro Arg Pro Pro Lys 
115 120 125 

Met Ala Arg Tyr Asp Asn Gly Ser Gly Tyr Arg Arg Gly Gly Ser Ser 
130 135 140 

Tyr Ser Gly Gly Gly Tyr Gly Gly Gly Tyr Ser Ser Gly Gly Tyr Gly 
145 150 155 160 

Ser Gly Gly Tyr Gly Gly Ser Ala Asn Ser Phe Arg Ala Gly Tyr Gly 
165 170 175 

Ala Gly Val Gly Gly Gly Tyr Arg Gly Val Ser Arg Gly Gly Phe Arg 
180 185 _ 190 

Gly Asn Ser Gly Gly Asp Tyr Arg Gly Pro Ser Gly Gly Tyr Arg Gly 
195 200 205 

Ser Gly Gly Phe Gin Arg Gly Gly Gly Arg Gly Ala Tyr Gly Thr Gly 
210 215 220 

Tyr Phe Gly Gin Gly Arg Gly Gly Gly Gly Tyr 
225 230 235 



<210> 9 

<211> 4989 

<212> DNA 

<213> Homo sapiens 
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<220> 

<223> insulin-like growth factor 1 receptor (IGF1R) 
(cell surface kinase receptor) cDNA 



<400> 9 

tttttttttt 

ggaggagggt 

tggccgacga 

ctgaagcgcc 

aaggccgagg 

ctgctgttcc 

atccgcggct 

aaggatattg 

aatgctgacc 

aataactaca 

atggaggaga 

tggaccacaa 

gagaacaatg 

acggcctgtg 

cccaacacct 

ctcagcgccg 

gagtgcccct 

ggtccttgcc 

tctgctcaga 

cgggggaata 

ggctacgtga 

cgcctcatcc 

cagaacttgc 

atgtactttg 

acggggacta 

gcctcctgtg 

atcataacct 

tactacaagg 

tccaacagct 

atcttactac 

ctcaccatgg 

accaatgctt 

cagttaatcg 

gtgcgctggc 

gacaaaatcc 

aaccccaaga 

gaagccgaga 

ctgcacaact 

gccaacacca 

accgacccgg 

gagagaactg 

tgcaaccacg 

atgcccgcag 

aactccatct 

gaaataaaat 

aggaagtatg 

caggccacat 

gccaaaacag 

ttgatcgtgg 

aggctgggga 

gtgtacgttc 

gggcaggggt 

cctgaaacca 

gagtttctca 

ctgggtgtgg 

gatctcaaaa 



ttttgagaaa 
ccccgacctc 
gtggagaaat 
tggagaactg 
actaccgcag 
gagtggctgg 
ggaaactctt 
ggctttacaa 
tctgttacct 
ttgtggggaa 
agccgatgtg 
accgctgcca 
agtgctgcca 
tagcttgccg 
acaggtttga 
agagcagcga 
cgggcttcat 
cgaaggtctg 
tgctccaagg 
acattgcttc 
agatccgcca 
taggagagga 
agcaactgtg 
ctttcaatcc 
aagggcgcca 
aaagtgacgt 
ggcaccggta 
aagcaccctt 
ggaacatggt 
atgggctgaa 
tggagaacga 
cagttccttc 
tgaagtggaa 
agcggcagcc 
ccatcaggaa 
ctgaggtgtg 
agcaggccga 
ccatcttcgt 
ccatgtccag 
aagagctgga 
tcatttctaa 
aggctgagaa 
aaggagcaga 
ttttaaagtg 
acggatcaca 
gaggggccaa 
ctctctctgg 
gatatgaaaa 
gagggttggt 
atggagtgct 
ctgatgagtg 
cgtttgggat 
gagtggccat 
acgaagcttc 
tgtcccaagg 
gttatctccg 



gggaatttca 
gctgtggggg 
ctgcgggcca 
cacggtgatc 
ctaccgcttc 
cctcgagagc 
ctacaactac 
cctgaggaac 
ctccactgtg 
taagccccca 
tgagaagacc 
gaaaatgtgc 
ccccgagtgc 
ccactactac 
gggctggcgc 
ctccgagggg 
ccgcaacggc 
tgaggaagaa 
atgcaccatc 
agagctggag 
ttctcatgcc 
gcagctagaa 
ggactgggac 
caaattatgt 
aagcaaaggg 
cctgcatttc 
ccggccccct 
taagaatgtc 
ggacgtggac 
gccctggact 
ccatatccgt 
cattcccttg 
ccctccctct 
tcaggacggc 
gtatgccgac 
tggtggggag 
gaaggaggag 
gcccagacct 
ccgaagcagg 
gacagagtac 
ccttcggcct 
gctgggctgc 
tgacattcct 
gccggaacct 
agttgaggat 
gctaaaccgg 
gaatgggtcg 
cttcatccat 
gattatgctg 
gtatgcctct 
ggaggtggct 
ggtctatgaa 
taaaacagtg 
tgtgatgaag 
ccagccaaca 
gtctctgagg 



tcccaaataa 
ctcctgtttc 
ggcatcgaca 
gagggctacc 
cccaagctca 
ctcggagacc 
gccctggtca 
attactcggg 
gactggtccc 
aaggaatgtg 
accatcaaca 
ccaagcacgt 
ctgggcagct 
tatgccggtg 
tgtgtggacc 
tttgtgatcc 
agccagagca 
aagaaaacaa 
ttcaagggca 
aacttcatgg 
ttggtctcct 
gggaattact 
caccgcaacc 
gtttccgaaa 
gacataaaca 
acctccacca 
gactacaggg 
acagagtatg 
ctcccgccca 
cagtacgccg 
ggggccaaga 
gacgttcttt 
ctgcccaacg 
tacctttacc 
ggcaccatcg 
aaagggcctt 
gctgaatacc 
gaaaggaagc 
aacaccacgg 
cctttctttg 
ttcacattgt 
agcgcctcca 
gggccagtga 
gagaatccca 
cagcgagaat 
ctaaacccgg 
tggacagatc 
ctgatcatcg 
tacgtcttcc 
gtgaacccgg 
cgggagaaga 
ggagttgcca 
aacgaggccg 
gagttcaatt 
ctggtcatca 
ccagaaatgg 



aaggaatgaa 
tctccgccgc 
tccgcaacga 
tccacatcct 
cggtcattac 
tcttccccaa 
tcttcgagat 
gggccatcag 
tgatcctgga 
gggacctgtg 
atgagtacaa 
gtgggaagcg 
gcagcgcgcc 
tctgtgtgcc 
gtgacttctg 
acgacggcga 
tgtactgcat 
agaccattga 
atttgctcat 
ggctcatcga 
tgtccttcct 
ccttctacgt 
tgaccatcaa 
tttaccgcat 
ccaggaacaa 
ccacgtcgaa 
atctcatcag 
atgggcagga 
acaaggacgt 
tttacgtcaa 
gtgagatctt 
cagcatcgaa 
gcaacctgag 
ggcacaatta 
acattgagga 
gctgcgcctg 
gcaaagtctt 
ggagagatgt 
ccgcagacac 
agagcagagt 
accgcatcga 
acttcgtctt 
cctgggagcc 
atggattgat 
gtgtgtccag 
ggaactacac 
ctgtgttctt 
ctctgcccgt 
atagaaagag 
agtacttcag 
tcaccatgag 
agggtgtggt 
caagcatgcg 
gtcaccatgt 
tggaactgat 
agaataatcc 



gtctggctcc 
gctctcgctc 
ctatcagcag 
gctcatctcc 
cgagtacttg 
cctcacggtc 
gaccaatctc 
gattgagaaa 
tgcggtgtcc 
tccagggacc 
ctaccgctgc 
ggcgtgcacc 
tgacaacgac 
tgcctgcccg 
cgccaacatc 
gtgcatgcag 
cccttgtgaa 
ttctgttact 
taacatccga 
ggtggtgacg 
aaaaaacctt 
cctcgacaac 
agcagggaaa 
ggaggaagtg 
cggggagaga 
gaatcgcatc 
cttcaccgtt 
tgcctgcggc 
ggagcccggc 
ggctgtgacc 
gtacattcgc 
ctcctcttct 
ttactacatt 
ctgctccaaa 
ggtcacagag 
ccccaaaact 
tgagaatttc 
catgcaagtg 
ctacaacatc 
ggataacaag 
tatccacagc 
tgcaaggact 
aaggcctgaa 
tctaatgtat 
acaggaatac 
agcccggatt 
ctatgtccag 
cgctgtcctg 
aaataacagc 
cgctgctgat 
ccgggaactt 
gaaagatgaa 
tgagaggatt 
ggtgcgattg 
gacacggggc 
agtcctagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

210 0 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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cctccaagcc 
ctcaacgcca 
gatttcacag 
taccggaaag 
gatggagtct 
gccacactgg 
atggagggcg 
cgcatgtgct 
atcaaagagg 
aagctgcccg 
gacccctcgg 
gagaacggcc 
tacgcccaca 
acctgctgat 
ggtggggggg 
cttcagttct 
atgttccttt 
tgggccttta 
tcctcactct 
ggaaaaataa 
tgtccctgtg 
aaaaacacgt 
ttacaaaggg 
aaatcctgaa 
tgagcatggc 
gactgcccct 
attattattt 
tgaaccggc 



tgagcaagat 
ataagttcgt 
tcaaaatcgg 
gaggcaaagg 
tcaccactta 
ccgagcagcc 
gccttctgga 
ggcagtataa 
agatggagcc 
agccggagga 
cctcctcgtc 
ccggccctgg 
tgaacggggg 
ccttggatcc 
gagagagagt 
gcccttgctg 
tttcaatatg 
agaaccttaa 
gtccctgtcc 
ttgccacaag 
gccccatcca 
ggagatggaa 
ccatcgttca 
ctttctccct 
agctggttgc 
gctgtgctgc 

gggggaactg 



gattcagatg 
ccacagagac 
agattttggt 
gctgctgccc 
ctcggacgtc 
ctaccagggc 
caagccagac 
ccccaagatg 
tggcttccgg 
gctggacctg 
ctccctgcca 
ggtgctggtc 
ccgcaagaac 
tgaatctgtg 
tttaacaatc 
cccgcgggag 
caagcagctt 
tgacaacact 
ttccctgttc 
tccagctggg 
accactgtac 
atttttacct 
tccaaggctg 
catcggcccg 
tccatttgag 
tcaaggccac 
gacacaatag 



gccggagaga 
cttgctgccc 
atgacgcgag 
gtgcgctgga 
tggtccttcg 
ttgtccaacg 
aactgtcctg 
aggccttcct 
gaggtctcct 
gagccagaga 
ctgcccgaca 
ctccgcgcca 
gagcgggcct 
caaacagtaa 
cattcacaag 
acagcttctc 
tttattccct 
taatagcaac 
tccctttctc 
aagccctttt 
acacccgcct 
ttatctttca 
ttaccatttt 
gcgctgattc 
agacacgctg 
aggcacacag 
gtctttctct 



ttgcagacgg 
ggaattgcat 
atatctatga 
tgtctcctga 
gggtcgtcct 
agcaagtcct 
acatgctgtt 
tcctggagat 
tctactacag 
acatggagag 
gacactcagg 
gcttcgacga 
tgccgctgcc 
cgtgtgcgca 
cctcctgtac 
tgcagtaaaa 
gcccaaaccc 
agagcacttg 
tctcctctct 
tatcagtttg 
gacaccgtgg 
cctttctagg 
aacgctgcct 
ctcgtgtccg 
gcgacacact 
gtctcattgc 
cagtgaaggt 



catggcatac 
ggtagccgaa 
gacagactat 
gtccctcaag 
ctgggagatc 
tcgcttcgtc 
tgaactgatg 
catcagcagc 
cgaggagaac 
cgtccccctg 
acacaaggcc 
gagacagcct 
ccagtcttcg 
cgcgcagcgg 
ctcagtggat 
cacatttggg 
ttaactgaca 
agaaccagtc 
gcttcataac 
aggaagtggc 
gtcattacaa 
gacatgaaat 
aattttgcca 
gaggcatggg 
ccgtccatcc 
ttctgactag 
ggggagaagc 



3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
4800 
4860 
4920 
4980 
4989 



<210> 10 
<211> 1367 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> insulin-like growth factor 1 receptor (IGF1R) 
(cell surface kinase receptor) 

<400> 10 

Met Lys Ser Gly Ser Gly Gly Gly Ser Pro Thr Ser Leu Trp Gly Leu 
1 5 10 15 

Leu Phe Leu Ser Ala Ala Leu Ser Leu Trp Pro Thr Ser Gly Glu lie 
20 25 30 

Cys Gly Pro Gly lie Asp lie Arg Asn Asp Tyr Gin Gin Leu Lys Arg 
35 40 45 

Leu Glu Asn Cys Thr Val lie Glu Gly Tyr Leu His lie Leu Leu lie 
50 55 60 

Ser Lys Ala Glu Asp Tyr Arg Ser Tyr Arg Phe Pro Lys Leu Thr Val 
65 70 75 80 

lie Thr Glu Tyr Leu Leu Leu Phe Arg Val Ala Gly Leu Glu Ser Leu 
85 90 95 

Gly Asp Leu Phe Pro Asn Leu Thr Val lie Arg Gly Trp Lys Leu Phe 
100 105 110 
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Tyr Asn Tyr Ala 
115 

Gly Leu Tyr Asn 
130 

Lys Asn Ala Asp 
145 

Leu Asp Ala Val 



Glu Cys Gly Asp 
180 

Glu Lys Thr Thr 
195 

Asn Arg Cys Gin 
210 

Thr Glu Asn Asn 
225 

Ala Pro Asp Asn 



Ala Gly Val Cys 
260 

Gly Trp Arg Cys 
275 

Glu Ser Ser Asp 
290 

Gin Glu Cys Pro 
305 

Cys lie Pro Cys 



Lys Thr Lys Thr 
340 

Cys Thr lie Phe 
355 

Asn lie Ala Ser 
370 

Thr Gly Tyr Val 
385 

Phe Leu Lys Asn 



Asn Tyr Ser Phe 
420 



Leu Val lie Phe 
120 

Leu Arg Asn lie 
135 

Leu Cys Tyr Leu 
150 

Ser Asn Asn Tyr 
165 

Leu Cys Pro Gly 



lie Asn Asn Glu 
200 

Lys Met Cys Pro 
215 

Glu Cys Cys His 
230 

Asp Thr Ala Cys 
245 

Val Pro Ala Cys 



Val Asp Arg Asp 
280 

Ser Glu Gly Phe 
295 

Ser Gly Phe lie 
310 

Glu Gly Pro Cys 
325 

He Asp Ser Val 



Lys Gly Asn Leu 
360 

Glu Leu Glu Asn 
375 

Lys He Arg His 
390 

Leu Arg Leu He 
405 

Tyr Val Leu Asp 



Glu Met Thr Asn 



Thr Arg Gly Ala 
140 

Ser Thr Val Asp 
155 

He Val Gly Asn 
170 

Thr Met Glu Glu 
185 

Tyr Asn Tyr Arg 



Ser Thr Cys Gly 
220 

Pro Glu Cys Leu 
235 

Val Ala Cys Arg 
250 

Pro Pro Asn Thr 
265 

Phe Cys Ala Asn 



Val He His Asp 
300 

Arg Asn Gly Ser 
315 

Pro Lys Val Cys 
330 

Thr Ser Ala Gin 
345 

Leu He Asn He 



Phe Met Gly Leu 
380 

Ser His Ala Leu 
395 

Leu Gly Glu Glu 
410 

Asn Gin Asn Leu 
425 




Leu Lys Asp He 
125 

He Arg He Glu 



Trp Ser Leu He 
160 

Lys Pro Pro Lys 
175 

Lys Pro Met Cys 
190 

Cys Trp Thr Thr 
205 

Lys Arg Ala Cys 



Gly Ser Cys Ser 
240 

His Tyr Tyr Tyr 
255 

Tyr Arg Phe Glu 
270 

He Leu Ser Ala 
285 

Gly Glu Cys Met 



Gin Ser Met Tyr 
320 

Glu Glu Glu Lys 
335 

Met Leu Gin Gly 
350 

Arg Arg Gly Asn 
365 

He Glu Val Val 



Val Ser Leu Ser 
400 

Gin Leu Glu Gly 
415 

Gin Gin Leu Trp 
430 
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Asp Trp Asp His 
435 

Ala Phe Asn Pro 
450 

Val Thr Gly Thr 
465 

Asn Asn Gly Glu 



Ser Thr Thr Thr 
500 

Arg Pro Pro Asp 
515 

Glu Ala Pro Phe 
530 

Gly Ser Asn Ser 
545 

Asp Val Glu Pro 



Tyr Ala Val Tyr 
580 

His lie Arg Gly 
595 

Ser Val Pro Ser 
610 

Ser Gin Leu lie 
625 

Leu Ser Tyr Tyr 



Leu Tyr Arg His 
660 

Tyr Ala Asp Gly 
675 

Thr Glu Val Cys 
690 

Thr Glu Ala Glu 
705 

Val Phe Glu Asn 



Arg Lys Arg Arg 
740 



Arg Asn Leu Thr 
440 

Lys Leu Cys Val 
455 

Lys Gly Arg Gin 
470 

Arg Ala Ser Cys 
485 

Ser Lys Asn Arg 



Tyr Arg Asp Leu 
520 

Lys Asn Val Thr 
535 

Trp Asn Met Val 
550 

Gly lie Leu Leu 
565 

Val Lys Ala Val 



Ala Lys Ser Glu 
600 

lie Pro Leu Asp 
615 

Val Lys Trp Asn 
630 

lie Val Arg Trp 
645 

Asn Tyr Cys Ser 



Thr lie Asp lie 
680 

Gly Gly Glu Lys 
695 

Lys Gin Ala Glu 
710 

Phe Leu His Asn 
725 

Asp Val Met Gin 



lie Lys Ala Gly 



Ser Glu lie Tyr 
460 

Ser Lys Gly Asp 
475 

Glu Ser Asp Val 
490 

He He He Thr 
505 

He Ser Phe Thr 



Glu Tyr Asp Gly 
540 

Asp Val Asp Leu 
555 

His Gly Leu Lys 
570 

Thr Leu Thr Met 
585 

He Leu Tyr He 



Val Leu Ser Ala 
620 

Pro Pro Ser Leu 
635 

Gin Arg Gin Pro 
650 

Lys Asp Lys He 
665 

Glu Glu Val Thr 



Gly Pro Cys Cys 
700 

Lys Glu Glu Ala 
715 

Ser He Phe Val 
730 

Val Ala Asn Thr 
745 




Lys Met Tyr Phe 
445 

Arg Met Glu Glu 



He Asn Thr Arg 
480 

Leu His Phe Thr 
495 

Trp His Arg Tyr 
510 

Val Tyr Tyr Lys 
525 

Gin Asp Ala Cys 



Pro Pro Asn Lys 
560 

Pro Trp Thr Gin 
575 

Val Glu Asn Asp 
590 

Arg Thr Asn Ala 
605 

Ser Asn Ser Ser 



Pro Asn Gly Asn 
640 

Gin Asp Gly Tyr 
655 

Pro He Arg Lys 
670 

Glu Asn Pro Lys 
685 

Ala Cys Pro Lys 



Glu Tyr Arg Lys 
720 

Pro Arg Pro Glu 
735 

Thr Met Ser Ser 
750 
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Arg Ser Arg Asn Thr Thr Ala Ala Asp Thr Tyr Asn lie Thr Asp Pro 
755 760 765 

Glu Glu Leu Glu Thr Glu Tyr Pro Phe Phe Glu Ser Arg Val Asp Asn 
770 775 780 

Lys Glu Arg Thr Val lie Ser Asn Leu Arg Pro Phe Thr Leu Tyr Arg 
785 790 795 800 

lie Asp lie His Ser Cys Asn His Glu Ala Glu Lys Leu Gly Cys Ser 
805 810 815 

Ala Ser Asn Phe Val Phe Ala Arg Thr Met Pro Ala Glu Gly Ala Asp 
820 825 830 

Asp lie Pro Gly Pro Val Thr Trp Glu Pro Arg Pro Glu Asn Ser lie 
835 840 845 

Phe Leu Lys Trp Pro Glu Pro Glu Asn Pro Asn Gly Leu lie Leu Met 
850 855 860 

Tyr Glu lie Lys Tyr Gly Ser Gin Val Glu Asp Gin Arg Glu Cys Val 
865 870 875 880 

Ser Arg Gin Glu Tyr Arg Lys Tyr Gly Gly Ala Lys Leu Asn Arg Leu 
885 890 895 

Asn Pro Gly Asn Tyr Thr Ala Arg lie Gin Ala Thr Ser Leu Ser Gly 
900 905 910 

Asn Gly Ser Trp Thr Asp Pro Val Phe Phe Tyr Val Gin Ala Lys Thr 
915 920 925 

Gly Tyr Glu Asn Phe lie His Leu lie lie Ala Leu Pro Val Ala Val 
930 935 940 

Leu Leu lie Val Gly Gly Leu Val lie Met Leu Tyr Val Phe His Arg 
945 950 955 960 

Lys Arg Asn Asn Ser Arg Leu Gly Asn Gly Val Leu Tyr Ala Ser Val 
965 970 975 

Asn Pro Glu Tyr Phe Ser Ala Ala Asp Val Tyr Val Pro Asp Glu Trp 
980 985 990 

Glu Val Ala Arg Glu Lys lie Thr Met Ser Arg Glu Leu Gly Gin Gly 
995 1000 1005 

Ser Phe Gly Met Val Tyr Glu Gly Val Ala Lys Gly Val Val Lys Asp 
1010 1015 1020 

Glu Pro Glu Thr Arg Val Ala lie Lys Thr Val Asn Glu Ala Ala Ser 
1025 1030 1035 1040 

Met Arg Glu Arg lie Glu Phe Leu Asn Glu Ala Ser Val Met Lys Glu 
1045 1050 1055 

Phe Asn Cys His His Val Val Arg Leu Leu Gly Val Val Ser Gin Gly 
1060 1065 1070 
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Gin Pro Thr Leu Val lie Met Glu Leu Met Thr Arg Gly Asp Leu Lys 
1075 1080 1085 

Ser Tyr Leu Arg Ser Leu Arg Pro Glu Met Glu Asn Asn Pro Val Leu 
1090 1095 1100 

Ala Pro Pro Ser Leu Ser Lys Met lie Gin Met Ala Gly Glu lie Ala 
1105 1110 1115 1120 

Asp Gly Met Ala Tyr Leu Asn Ala Asn Lys Phe Val His Arg Asp Leu 
1125 1130 1135 

Ala Ala Arg Asn Cys Met Val Ala Glu Asp Phe Thr Val Lys lie Gly 
1140 1145 1150 

Asp Phe Gly Met Thr Arg Asp lie Tyr Glu Thr Asp Tyr Tyr Arg Lys 
1155 1160 1165 

Gly Gly Lys Gly Leu Leu Pro Val Arg Trp Met Ser Pro Glu Ser Leu 
1170 1175 1180 

Lys Asp Gly Val Phe Thr Thr Tyr Ser Asp Val Trp Ser Phe Gly Val 
1185 1190 1195 1200 

Val Leu Trp Glu lie Ala Thr Leu Ala Glu Gin Pro Tyr Gin Gly Leu 
1205 1210 1215 

Ser Asn Glu Gin Val Leu Arg Phe Val Met Glu Gly Gly Leu Leu Asp 
1220 1225 1230 

Lys Pro Asp Asn Cys Pro Asp Met Leu Phe Glu Leu Met Arg Met Cys 
1235 1240 1245 

Trp Gin Tyr Asn Pro Lys Met Arg Pro Ser Phe Leu Glu lie lie Ser 
1250 1255 1260 

Ser lie Lys Glu Glu Met Glu Pro Gly Phe Arg Glu Val Ser Phe Tyr 
1265 1270 1275 1280 

Tyr Ser Glu Glu Asn Lys Leu Pro Glu Pro Glu Glu Leu Asp Leu Glu 
1285 1290 1295 

Pro Glu Asn Met Glu Ser Val Pro Leu Asp Pro Ser Ala Ser Ser Ser 
1300 1305 1310 

Ser Leu Pro Leu Pro Asp Arg His Ser Gly His Lys Ala Glu Asn Gly 
1315 1320 1325 

Pro Gly Pro Gly Val Leu Val Leu Arg Ala Ser Phe Asp Glu Arg Gin 
1330 1335 1340 

Pro Tyr Ala His Met Asn Gly Gly Arg Lys Asn Glu Arg Ala Leu Pro 
1345 1350 1355 1360 

Leu Pro Gin Ser Ser Thr Cys 
1365 
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<210> 11 

<211> 2394 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> ubiquitin-conjugating enzyme E2 variant 1 
(UBE2V1) , transcript variant 2 cDNA 



<400> 11 

ttcacacggc 

ctgctaaaca 

ccctgggagt 

aagtggtcgc 

gtcatcctgc 

tgcatcacca 

ggccagaaag 

acacttacaa 

atatacagcc 

tttgtaacaa 

gccatatcag 

cttcggcgcc 

tgttacagca 

ttaatcagtc 

gtaccggaaa 

aattttttgt 

accactgtcc 

cataactggt 

tgctgattac 

cctgcaacaa 

cgcagcctgt 

taagtcttaa 

cccttggccg 

gaattcaggg 

tctccctcct 

ccttgctttg 

agttttggaa 

gccctgccac 

gtgaaaagaa 

gaacttggag 

ttactttcct 

tttggtcatc 

aaccttgaat 

cagggattaa 

cctggcaaat 

atgcataccg 

tagtatttgt 

gtttccatta 

gtgaagttta 

ttttaatata 



acgacttcat 
tggcctacaa 
gcttcgtctt 
acacgtactt 
cacgtaaaca 
caggagtaaa 
gagtaggaga 
gatggacagg 
ttaaaataga 
aaattaatat 
tgctagcaaa 
taatgatgtc 
attaatcaaa 
ttcattttcc 
ggaagctccc 
ccatttgaaa 
acgtagttga 
ggggcacatc 
acggcctggg 
cagccctcta 
gggactactg 
gtgatgcccc 
aagcatagat 
ctttccccat 
cctcaagttc 
gccagaagcc 
ccatactcac 
cctctgctgt 
tagtcaccag 
aaagaccgca 
gtaactgctt 
cttgcaatcc 
ccggtgcatg 
gaaggaaccc 
tgctgcgtct 
aaataaaagc 
gtaaaaccac 
atctttttct 
acataacagt 
ataaaaaaaa 



cgagaccaac 
gttccgcacc 
ctgcctgatc 
tgggctgcca 
ccatcgcatc 
agtccctcgc 
tggcacagtt 
gatgataatt 
atgtggacct 
gaatggagta 
atggcagaat 
taaagaaaat 
aagaaaaacc 
acagtagtaa 
attcaaagga 
tatataagtt 
acttctggga 
taactcaact 
gtctctgcct 
gcctgggggg 
ctaggtgtgt 
ttccaaacca 
tgtaacccct 
atcttctctc 
ctttttgcac 
atcaggtaag 
tcactctcca 
catcagctga 
ggttactcag 
tgaagatact 
ttgcttttaa 
attggggtct 
ccttggtttt 
ggtgtgcaca 
ttccacttgc 
aattcattgt 
cttttgaagc 
ggggggaaaa 
attccataag 
agtgtgcgtt 



ggggacaact 
cacagccctg 
atcttcggca 
cgctgggtca 
caccacgtct 
aatttccgac 
agctggggtc 
gggcctccaa 
aaatacccag 
aatagttcta 
tcatatagca 
atgaaactcc 
acaggccctt 
attttctaga 
aatttatctt 
gtgctataac 
tcaagaaagt 
gtgaaaagac 
tctcccttta 
cttgttagag 
ggggtgtttc 
tcatcctgtc 
ccactcccct 
cccccacctt 
cgtcaccacc 
gttggaaaga 
ccagcctggg 
tgcattgttt 
acctgccagc 
tgtaagcaca 
aaattgaaga 
agtttggaat 
ggtgctgctg 
gcagatcccc 
tgttcaggac 
gtactaaagg 
agcaactatc 
ccttagttct 
cagccttttt 
aataaaaaaa 



gcctggtgac 
aagccctgga 
ccttcaccaa 
ccctcctgca 
caccccacga 
tgttggaaga 
tagaagatga 
gaacaattta 
aagcaccccc 
atggagtggt 
tcaaagttgt 
ctcagccgcc 
ccccttcccc 
tacgtcttgt 
aagatactgt 
aaatcatcct 
ctatttaaat 
acatcacaca 
ccctcccgcc 
tagatgtgaa 
gcctgcaccc 
cccacgctcc 
ctgagattgg 
tatcgagggg 
caacaccttc 
gcctctgacc 
aaatgaatat 
ttagctcagg 
tctcggagtc 
catgatccct 
agttttaaac 
ctgacaactg 
ctgcttccca 
gaaattggtg 
cactaaatgc 
tttttttttt 
aagtctgaaa 
aaggatttaa 
attgtcagac 
aaaaaaaaaa 



actgctgccg 
gcagctatac 
ccagatccac 
ggactggcat 
gacctacttc 
actcgaagaa 
cgaagacatg 
tgaaaaccga 
ctttgtaaga 
ggacccaaga 
cctgcaagag 
cgaaggacag 
ccaattcgat 
agacctcaaa 
aaatgatact 
gtcaagtgta 
tgattcccat 
atcaccttgc 
tcccaccctc 
ggtttcaggt 
ctggttcctt 
tccactcccg 
cttcggtgag 
tgctgctttt 
catgacactt 
tcccttgttt 
tgggtcctca 
ttttgataag 
cttggtggtt 
ctgaattgtt 
agggctttca 
gaacaaaaag 
agatcctcag 
ggcttgacct 
tgaaatgtgg 
ttttttaatt 
agcaattgat 
catcctgtaa 
cattgcctga 
aaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2394 



<210> 12 

<211> 221 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> ubiquitin-conjugating enzyme E2 variant 1 
(UBE2V1) , transcript variant 2 
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<400> 12 

Met Ala Tyr Lys Phe Arg Thr His Ser Pro Glu Ala Leu Glu Gin Leu 
15 10 15 

Tyr Pro Trp Glu Cys Phe Val Phe Cys Leu lie lie Phe Gly Thr Phe 
20 25 30 

Thr Asn Gin lie His Lys Trp Ser His Thr Tyr Phe Gly Leu Pro Arg 
35 40 45 

Trp Val Thr Leu Leu Gin Asp Trp His Val lie Leu Pro Arg Lys His 
50 55 60 

His Arg lie His His Val Ser Pro His Glu Thr Tyr Phe Cys lie Thr 
65 70 75 80 

Thr Gly Val Lys Val Pro Arg Asn Phe Arg Leu Leu Glu Glu Leu Glu 
85 90 95 

Glu Gly Gin Lys Gly Val Gly Asp Gly Thr Val Ser Trp Gly Leu Glu 
100 105 110 

Asp Asp Glu Asp Met Thr Leu Thr Arg Trp Thr Gly Met lie lie Gly 
115 120 125 

Pro Pro Arg Thr lie Tyr Glu Asn Arg lie Tyr Ser Leu Lys lie Glu 
130 135 140 

Cys Gly Pro Lys Tyr Pro Glu Ala Pro Pro Phe Val Arg Phe Val Thr 
145 150 155 160 

Lys lie Asn Met Asn Gly Val Asn Ser Ser Asn Gly Val Val Asp Pro 
165 170 175 

Arg Ala lie Ser Val Leu Ala Lys Trp Gin Asn Ser Tyr Ser lie Lys 
180 185 190 

Val Val Leu Gin Glu Leu Arg Arg Leu Met Met Ser Lys Glu Asn Met 
195 200 205 

Lys Leu Pro Gin Pro Pro Glu Gly Gin Cys Tyr Ser Asn 
210 215 220 



<210> 13 

<211> 1506 

<212> DNA 

<213> Homo sapiens 

<220> 

<2 23> NADP+ dependent aldehyde dehydrogenase 1, soluble 
(ALDH1) cDNA 

<400> 13 

atgtcatcct caggcacgcc agacttacct gtcctactca ccgatttgaa gattcaatat 60 

actaagatct tcataaacaa tgaatggcat gattcagtga gtggcaagaa atttcctgtc 12 0 

tttaatcctg caactgagga ggagctctgc caggtagaag aaggagataa ggaggatgtt 180 

gacaaggcag tgaaggccgc aagacaggct tttcagattg gatctccgtg gcgtactatg 240 

gatgcttccg agagggggcg actattatac aagttggctg atttaatcga aagagatcgt 3 00 

ctgctgctgg cgacaatgga gtcaatgaat ggtggaaaac tctattccaa tgcatatctg 360 

agtgatttag caggctgcat caaaacattg cgctactgtg caggttgggc tgacaagatc 420 
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cagggccgta 
ggtgtatgtg 
gggcctgcac 
actgctctcc 
attgttcctg 
aaagtagcct 
agcaatctga 
gatgccgact 
cagtgttgta 
cgaaggagtg 
actcaaggcc 
gggaagaaag 
tttgtccagc 
atttttggac 
gcaaacaata 
ataacaatct 
agtgcccagt 
tacggtttcc 
tcataa 



caataccaat 
gccaaatcat 
tgagctgtgg 
acgtggcatc 
gttatgggcc 
tcacaggatc 
agagggtgac 
tggacaatgc 
tagccgcatc 
ttgagcgggc 
ctcagattga 
aaggggccaa 
ccacagtgtt 
cagtgcagca 
ctttctatgg 
cctctgctct 
gcccctttgg 
atgaatatac 



tgatggaaat 
tccttggaat 
aaacacagtg 
tttaataaaa 
tacagcaggg 
aacagaggtt 
cctggagctt 
tgttgaattt 
caggattttt 
taagaagtat 
caaggaacaa 
actggaatgt 
ctctaatgtt 
aatcatgaag 
cttatcagca 
gcaggcagga 
cggattcaag 
agaggtcaaa 



ttttttacat 
ttcccgttgg 
gttgtcaaac 
gaggcagggt 
gcagccattt 
ggcaagttga 
ggaggaaaga 
gcacaccatg 
gtggaagaat 
atccttggaa 
tatgataaaa 
ggaggaggcc 
acagatgaga 
tttaaatctt 
ggagtgttta 
acagtgtggg 
atgtctggaa 
acagtcacag 



atacaagaca 
ttatgctcat 
cagcagagca 
ttcctcctgg 
cttctcacat 
tcaaagaagc 
gcccttgcat 
gggtattcta 
caatttatga 
atcctctgac 
tacttgacct 
cgtgggggaa 
tgcgcattgc 
tagatgacgt 
ccaaagacat 
tgaattgcta 
atggaagaga 
tgaaaatctc 



tgaacctatt 
ttggaagata 
aactcctctc 
agtagtgaat 
ggatatagac 
tgccgggaaa 
tgtgttagct 
ccaccagggc 
tgagtttgtt 
cccaggagtc 
cattgagagt 
taaaggctac 
caaagaggag 
gatcaaaaga 
tgataaagcc 
tggcgtggta 
actgggagag 
tcagaagaac 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1506 



<210> 14 
<211> 501 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> NADP+ dependent aldehyde dehydrogenase 1, soluble 
(ALDH1) 

<400> 14 

Met Ser Ser Ser Gly Thr Pro Asp Leu Pro Val Leu Leu Thr Asp Leu 
15 10 15 

Lys lie Gin Tyr Thr Lys lie Phe lie Asn Asn Glu Trp His Asp Ser 
20 25 30 

Val Ser Gly Lys Lys Phe Pro Val Phe Asn Pro Ala Thr Glu Glu Glu 
35 40 45 

Leu Cys Gin Val Glu Glu Gly Asp Lys Glu Asp Val Asp Lys Ala Val 
50 55 60 

Lys Ala Ala Arg Gin Ala Phe Gin lie Gly Ser Pro Trp Arg Thr Met 
65 70 75 80 

Asp Ala Ser Glu Arg Gly Arg Leu Leu Tyr Lys Leu Ala Asp Leu lie 
85 90 95 

Glu Arg Asp Arg Leu Leu Leu Ala Thr Met Glu Ser Met Asn Gly Gly 
100 105 110 

Lys Leu Tyr Ser Asn Ala Tyr Leu Ser Asp Leu Ala Gly Cys lie Lys 
115 120 125 

Thr Leu Arg Tyr Cys Ala Gly Trp Ala Asp Lys lie Gin Gly Arg Thr 
130 135 140 

lie Pro lie Asp Gly Asn Phe Phe Thr Tyr Thr Arg His Glu Pro lie 
145 150 155 160 
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Gly Val Cys Gly Gin lie lie Pro Trp Asn Phe Pro Leu Val Met Leu 
165 170 175 

lie Trp Lys lie Gly Pro Ala Leu Ser Cys Gly Asn Thr Val Val Val 
180 185 190 

Lys Pro Ala Glu Gin Thr Pro Leu Thr Ala Leu His Val Ala Ser Leu 
195 200 205 

lie Lys Glu Ala Gly Phe Pro Pro Gly Val Val Asn He Val Pro Gly 
210 215 220 

Tyr Gly Pro Thr Ala Gly Ala Ala He Ser Ser His Met Asp He Asp 
225 230 235 240 

Lys Val Ala Phe Thr Gly Ser Thr Glu Val Gly Lys Leu He Lys Glu 
245 250 255 

Ala Ala Gly Lys Ser Asn Leu Lys Arg Val Thr Leu Glu Leu Gly Gly 
260 265 270 

Lys Ser Pro Cys He Val Leu Ala Asp Ala Asp Leu Asp Asn Ala Val 
275 280 285 

Glu Phe Ala His His Gly Val Phe Tyr His Gin Gly Gin Cys Cys He 
290 295 300 

Ala Ala Ser Arg He Phe Val Glu Glu Ser He Tyr Asp Glu Phe Val 
305 310 315 320 

Arg Arg Ser Val Glu Arg Ala Lys Lys Tyr He Leu Gly Asn Pro Leu 
325 330 335 

Thr Pro Gly Val Thr Gin Gly Pro Gin He Asp Lys Glu Gin Tyr Asp 
340 345 350 

Lys He Leu Asp Leu He Glu Ser Gly Lys Lys Glu Gly Ala Lys Leu 
355 360 365 

Glu Cys Gly Gly Gly Pro Trp Gly Asn Lys Gly Tyr Phe Val Gin Pro 
370 375 380 

Thr Val Phe Ser Asn Val Thr Asp Glu Met Arg He Ala Lys Glu Glu 
385 390 395 400 

He Phe Gly Pro Val Gin Gin He Met Lys Phe Lys Ser Leu Asp Asp 
405 410 415 

Val He Lys Arg Ala Asn Asn Thr Phe Tyr Gly Leu Ser Ala Gly Val 
420 425 430 

Phe Thr Lys Asp He Asp Lys Ala He Thr He Ser Ser Ala Leu Gin 
435 440 445 

Ala Gly Thr Val Trp Val Asn Cys Tyr Gly Val Val Ser Ala Gin Cys 
450 455 460 

Pro Phe Gly Gly Phe Lys Met Ser Gly Asn Gly Arg Glu Leu Gly Glu 
465 470 475 480 
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Tyr Gly Phe His Glu Tyr Thr Glu Val Lys Thr Val Thr Val Lys lie 
485 490 495 

Ser Gin Lys Asn Ser 
500 



<210> 15 
<211> 2282 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> similar to pyruvate kinase, muscle cDNA 



<400> 15 

ggctgaggca 

ctttgcagcg 

catagtgaag 

acattcctgg 

actggcatca 

attaagtctg 

gcggagacca 

taccggcccg 

aagggcagcg 

gataacgcct 

tgcaaggtgg 

gtgaagcaga 

agcaagaagg 

gacatccagg 

atccgcaagg 

atcaagatta 

gaggccagtg 

aaggtcttcc 

atctgtgcta 

ggcagtgatg 

acagccaaag 

gcagaggctg 

accagcgacc 

agtggggcca 

cgcccacgtg 

ctgtaccgtg 

gacgtggacc 

aagggagatg 

atgcgtgttg 

ccccttcccc 

acgtggcact 

tgtgtttgca 

acccaatcaa 

agggtgacag 

ccagaggact 

agggcactgg 

tagctctcta 

caaacactcc 

gc 



gtggctcctt 
tagcccgagt 
ccgggactgc 
agcacatgtg 
tctgtaccat 
gaatgaatgt 
tcaagaatgt 
ttgctgtggc 
gcactgcaga 
acatggaaaa 
tggaagtggg 
aaggtgccga 
gtgtgaacct 
atctgaagtt 
catctgatgt 
tcagcaaaat 
atgggatcat 
ttgctcagaa 
ctcagatgct 
tggccaatgc 
gggactatcc 
ccatctacca 
ccacagaagc 
taatcgtcct 
cccccatcat 
gcatcttccc 
tccgggtgaa 
tggtcattgt 
ttcctgtgcc 
cagcccatcc 
ggtaggttgg 
gcctgctcta 
gggaagaagg 
cttcctttcc 
cccaaccctg 
gctgttgttc 
ggcctctcca 
accctccacc 



gcacagcagc 
cggtcagcgc 
cttcattcag 
ccgcctggac 
tggcccagct 
ggctcgtctg 
gcgcacagcc 
tctagacact 
ggtggagctg 
gtgtgacgag 
cagcaagatc 
cttcctggtg 
tcctggggct 
tggggtcgag 
ccatgaagtt 
cgagaatcat 
ggtggctcgt 
gatgatgatt 
ggagagcatg 
agtcctggat 
tctggaggct 
cttgcaatta 
caccgccgtg 
caccaagtct 
tgctgtgacc 
tgtgctgtgc 
ctttgccatg 
gctgaccgga 
gtgatggacc 
attaggccag 
gacaccaggg 
gtgggacagc 
aggaatgctg 
tgtgtgtact 
gcttggggtc 
cattgaagcc 
gtttgcacct 
ttccattttc 



tgcacgcgcc 
cggaggacct 
acccagcagc 
attgattcac 
tcccgatcag 
aacttctctc 
acggaaagct 
aaaggacctg 
aagaagggag 
aacatcctgt 
tacgtggatg 
acggaggtgg 
gctgtggact 
caggatgttg 
aggaaggtcc 
gagggggttc 
ggtgatctag 
ggacggtgca 
atcaagaagc 
ggagccgact 
gtgcgcatgc 
tttgaggaac 
ggtgccgtgg 
ggcaggtctg 
cggaatcccc 
aaggacccag 
aatgttggca 
tggcgccctg 
ccagagcccc 
caacgcttgt 
aagaagatca 
ccagagcctg 
gactggaggc 
ctgtccagtt 
aagaaacagc 
gactctggcc 
gtccccaccc 
ccccactact 



gtggctccgg 
cagcagccat 
tgcacgcagc 
cacccatcac 
tggagacgtt 
atggaactca 
ttgcttctga 
agatccgaac 
ccactctcaa 
ggctggacta 
atgggcttat 
aaaatggtgg 
tgcctgctgt 
atatggtgtt 
tgggagagaa 
ggaggtttga 
gcattgagat 
accgagctgg 
cccgccccac 
gcatcatgct 
agcacctgat 
tccgccgcct 
aggcctcctt 
ctcaccaggt 
agacagctcg 
tccaggaggc 
aggcccgagg 
gctccggctt 
tcctccagcc 
agaactcact 
acgcctcact 
gctgcccatc 
ccctggagcc 
cctttagaaa 
cagcaagagt 
ctggccctta 
tccactcagc 
gcagcacctc 



atctcttcgt 
gtcgaagccc 
catggctgac 
agcccggaac 
gaaggagatg 
tgagtaccat 
ccccatcctc 
tgggctcatc 
aatcacgctg 
caagaacatc 
ttctctccag 
ctccttgggc 
gtcggagaag 
tgcgtcattc 
gggaaagaac 
tgaaatcctg 
tcctgcagag 
gaagcctgtc 
tcgggctgaa 
gtctggagaa 
tgcccgtgag 
ggcgcccatt 
caagtgctgc 
ggccagatac 
tcaggcccac 
ctgggctgag 
cttcttcaag 
caccaacacc 
cctgtcccac 
ctgggctgta 
gaaacatggc 
atgtggcccc 
agatggcaag 
aaatggatgc 
taggggcctt 
cttgcttctc 
tgtcctgcag 
caggcctgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2282 



<210> 16 
<211> 531 
<212> PRT 

<213> Homo sapiens 
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<220> 

<223> similar to pyruvate kinase, muscle 
<400> 16 

Met Ser Lys Pro His Ser Glu Ala Gly Thr Ala Phe lie Gin Thr Gin 
15 10 15 

Gin Leu His Ala Ala Met Ala Asp Thr Phe Leu Glu His Met Cys Arg 
20 25 30 

Leu Asp lie Asp Ser Pro Pro lie Thr Ala Arg Asn Thr Gly lie lie 
35 40 45 

Cys Thr lie Gly Pro Ala Ser Arg Ser Val Glu Thr Leu Lys Glu Met 
50 55 60 

lie Lys Ser Gly Met Asn Val Ala Arg Leu Asn Phe Ser His Gly Thr 
65 70 75 80 

His Glu Tyr His Ala Glu Thr lie Lys Asn Val Arg Thr Ala Thr Glu 
85 90 95 

Ser Phe Ala Ser Asp Pro He Leu Tyr Arg Pro Val Ala Val Ala Leu 
100 105 110 

Asp Thr Lys Gly Pro Glu He Arg Thr Gly Leu He Lys Gly Ser Gly 
115 120 125 

Thr Ala Glu Val Glu Leu Lys Lys Gly Ala Thr Leu Lys He Thr Leu 
130 135 140 

Asp Asn Ala Tyr Met Glu Lys Cys Asp Glu Asn He Leu Trp Leu Asp 
145 150 155 160 

Tyr Lys Asn He Cys Lys Val Val Glu Val Gly Ser Lys He Tyr Val 
165 170 175 

Asp Asp Gly Leu He Ser Leu Gin Val Lys Gin Lys Gly Ala Asp Phe 
180 185 190 

Leu Val Thr Glu Val Glu Asn Gly Gly Ser Leu Gly Ser Lys Lys Gly 
195 200 205 

Val Asn Leu Pro Gly Ala Ala Val Asp Leu Pro Ala Val Ser Glu Lys 
210 215 220 

Asp He Gin Asp Leu Lys Phe Gly Val Glu Gin Asp Val Asp Met Val 
225 230 235 240 

Phe Ala Ser Phe He Arg Lys Ala Ser Asp Val His Glu Val Arg Lys 
245 250 255 

Val Leu Gly Glu Lys Gly Lys Asn He Lys He He Ser Lys He Glu 
260 265 270 

Asn His Glu Gly Val Arg Arg Phe Asp Glu He Leu Glu Ala Ser Asp 
275 280 285 

Gly He Met Val Ala Arg Gly Asp Leu Gly He Glu He Pro Ala Glu 
290 295 300 
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• 



Lys Val Phe Leu Ala Gin Lys Met Met lie Gly Arg Cys Asn Arg Ala 
305 310 315 320 

Gly Lys Pro Val lie Cys Ala Thr Gin Met Leu Glu Ser Met lie Lys 
325 330 335 

Lys Pro Arg Pro Thr Arg Ala Glu Gly Ser Asp Val Ala Asn Ala Val 
340 345 350 

Leu Asp Gly Ala Asp Cys lie Met Leu Ser Gly Glu Thr Ala Lys Gly 
355 360 365 

Asp Tyr Pro Leu Glu Ala Val Arg Met Gin His Leu lie Ala Arg Glu 
370 375 380 

Ala Glu Ala Ala lie Tyr His Leu Gin Leu Phe Glu Glu Leu Arg Arg 
385 390 395 400 

Leu Ala Pro lie Thr Ser Asp Pro Thr Glu Ala Thr Ala Val Gly Ala 
405 410 415 

Val Glu Ala Ser Phe Lys Cys Cys Ser Gly Ala lie lie Val Leu Thr 
420 425 430 

Lys Ser Gly Arg Ser Ala His Gin Val Ala Arg Tyr Arg Pro Arg Ala 
435 440 445 

Pro lie lie Ala Val Thr Arg Asn Pro Gin Thr Ala Arg Gin Ala His 
450 455 460 

Leu Tyr Arg Gly lie Phe Pro Val Leu Cys Lys Asp Pro Val Gin Glu 
465 470 475 480 

Ala Trp Ala Glu Asp Val Asp Leu Arg Val Asn Phe Ala Met Asn Val 
485 490 495 

Gly Lys Ala Arg Gly Phe Phe Lys Lys Gly Asp Val Val lie Val Leu 
500 505 510 

Thr Gly Trp Arg Pro Gly Ser Gly Phe Thr Asn Thr Met Arg Val Val 
515 520 525 

Pro Val Pro 
530 



<210> 17 
<211> 2631 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> glucose -6 -phosphate dehydrogenase (G6PD) cDNA 
<400> 17 

agggacagcc cagaggaggc gtggccacgc tgccggcgga agtggagccc tccgcgagcg 6 0 
cgcgaggccg ccggggcagg cggggaaacc ggacagtagg ggcggggccg ggccggcgat 12 0 
ggggatgcgg gagcactacg cggagctgca cccgtgcccg ccggaattgg ggatgcagag 180 
cagcggcagc gggtatggca ggcagccggc gggccggcct ccagcgcagg tgcccgagag 24 0 
gcaggggctg gcctgggatg cgcgcgcacc tgccctcgcc ccgccccgcc cgcacgaggg 3 00 
gtggtggccg aggccccgcc ccgcacgcct cgcctgaggc gggtccgctc agcccaggcg 3 60 
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cccgcccccg 
acacttcggg 
gagcaggtgg 
ggcgatgcct 
ctggccaaga 
gaaaacacct 
cagagtgagc 
gcccgcaact 
agccacatgg 
cccccgaccg 
ggctggaacc 
ctgtccaacc 
ctgggcaagg 
cccatctgga 
actgagggtc 
cacctactgc 
gacgtccgtg 
gtggtcctgg 
ctggacgacc 
tatgtggaga 
aacgagcgca 
cagcagtgca 
aagatgatga 
acctacggca 
gacgtcttct 
cgtattttca 
atttatggca 
tatgagggca 
cacccccgcc 
accattgacc 
gctgctgcta 
tggcccctcc 
ccaacagaag 
tctcactcct 
acccacgtga 
cttgtcacca 
catggccacc 
gctgggaccc 



cccccgccga 
gctgcgagcg 
ccctgagccg 
tccatcagtc 
agaagatcta 
tcatcatggg 
ccttcttcaa 
cctatgtggc 
atgccctcca 
tctacgaggc 
gcatcatcgt 
acatctcctc 
agatggtgca 
accgggacaa 
gcgggggcta 
agatgctgtg 
atgagaaggt 
gccagtacgt 
ccacggtgcc 
atgagaggtg 
aggccgaggt 
agcgcaacga 
ccaagaagcc 
acagatacaa 
gcgggagcca 
ccccactgct 
gccgaggccc 
cctacaagtg 
acggccaccc 
tcagctgcac 
ctacccgagc 
agaccctgcc 
gaaggaggag 
gagtggggcc 
gagaatctgc 
gcaacatctc 
ccgtgccacc 
ctcccaacct 



ttaaatgggc 
cggagggcga 
gacccaggtg 
ggatacacac 
ccccaccatc 
ctatgcccgt 
ggccacccca 
tggccagtac 
cctggggtca 
cgtcaccaag 
ggagaagccc 
cctgttccgt 
gaacctcatg 
catcgcctgc 
tttcgatgaa 
tctggtggcc 
caaggtgttg 
ggggaacccc 
ccgcgggtcc 
ggatggggtg 
gaggctgcag 
gctggtgatc 
gggcatgttc 
gaacgtgaag 
gatgcacttc 
gcaccagatt 
cacggaggca 
ggtgaacccc 
tccttcccgc 
attcctggcc 
ccagctacat 
tgagcccagg 
ggcgcccatt 
agggtgggag 
ctgtggcctt 
gagccccctg 
cgtaggcagc 
caatgccctg 



cggcggggct 
cgacgacgaa 
tgcgggatcc 
atattcatca 
tggtggctgt 
tcccgcctca 
gaggagaagc 
gatgatgcag 
caggccaacc 
aacattcacg 
ttcgggaggg 
gaggaccaga 
gtgctgagat 
gttatcctca 
tttgggatca 
atggagaagc 
aaatgcatct 
gatggagagg 
accaccgcca 
cccttcatcc 
ttccatgatg 
cgcgtgcagc 
ttcaaccccg 
ctccctgacg 
gtgcgcagcg 
gagctggaga 
gacgagctga 
cacaagctct 
cgcccgaccc 
ccgggctctg 
tcctcagctg 
agctgagtca 
cgtctgtccc 
ggagggacaa 
gcccgccagc 
gatgtcccct 
ctctctgcta 
ccattaaatc 



cagcccccgg 
gcgcagacag 
tgcgggaaga 
tcatgggtgc 
tccgggatgg 
cagtggctga 
tcaagctgga 
cctcctacca 
gcctcttcta 
agtcctgcat 
acctgcagag 
tctaccgcat 
ttgccaacag 
ccttcaagga 
tccgggacgt 
ccgcctccac 
cagaggtgca 
gcgaggccac 
cttttgcagc 
tgcgctgcgg 
tggccggcga 
ccaacgaggc 
aggagtcgga 
cctacgagcg 
acgagctccg 
agcccaagcc 
tgaagagagt 
gagccctggg 
cgagtcggga 
gccaccctgg 
ccaagcactc 
cctcctccac 
agagcttatt 
gggggaggaa 
ctcagtgcca 
gtcccaccaa 
taagaaaagc 
cgcaaacagc 



aaacggtcgt 
cgtcatggca 
gcttttccag 
atcgggtgac 
ccttctgccc 
catccgcaaa 
ggacttcttt 
gcgcctcaac 
cctggccttg 
gagccagata 
ctctgaccgg 
cgaccactac 
gatcttcggc 
gccctttggc 
gatgcagaac 
caactcagat 
ggccaacaat 
caaagggtac 
cgtcgtcctc 
caaggccctg 
catcttccac 
cgtgtacacc 
gctggacctg 
cctcatcctg 
tgaggcctgg 
catcccctat 
gggtttccag 
cacccacctc 
ggactccggg 
cccgcccctc 
gagaccatcc 
tcactccagc 
ggccactggg 
aggggcgagc 
cttgacattc 
ctctgcactc 
agacgcagca 
c 



420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2631 



<210> 18 
<211> 515 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> glucose-6-phosphate dehydrogenase (G6PD) 
<400> 18 

Met Ala Glu Gin Val Ala Leu Ser Arg Thr Gin Val Cys Gly lie Leu 
15 10 15 

Arg Glu Glu Leu Phe Gin Gly Asp Ala Phe His Gin Ser Asp Thr His 
20 25 30 

lie Phe lie lie Met Gly Ala Ser Gly Asp Leu Ala Lys Lys Lys lie 
35 40 45 

Tyr Pro Thr lie Trp Trp Leu Phe Arg Asp Gly Leu Leu Pro Glu Asn 
50 55 60 
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Thr Phe lie Met 
65 

Arg Lys Gin Ser 



Lys Leu Glu Asp 
100 

Asp Asp Ala Ala 
115 

His Leu Gly Ser 
130 

Thr Val Tyr Glu 
145 

Gin lie Gly Trp 



Leu Gin Ser Ser 
180 

Glu Asp Gin lie 
195 

Gin Asn Leu Met 
210 

Trp Asn Arg Asp 
225 

Phe Gly Thr Glu 



Arg Asp Val Met 
260 

Met Glu Lys Pro 
275 

Val Lys Val Leu 
290 

Leu Gly Gin Tyr 
305 

Gly Tyr Leu Asp 



Phe Ala Ala Val 
340 

Pro Phe lie Leu 
355 

Val Arg Leu Gin 
370 



Gly Tyr Ala Arg 
70 

Glu Pro Phe Phe 
85 

Phe Phe Ala Arg 



Ser Tyr Gin Arg 
120 

Gin Ala Asn Arg 
135 

Ala Val Thr Lys 
150 

Asn Arg lie lie 
165 

Asp Arg Leu Ser 



Tyr Arg lie Asp 
200 

Val Leu Arg Phe 
215 

Asn lie Ala Cys 
230 

Gly Arg Gly Gly 
245 

Gin Asn His Leu 



Ala Ser Thr Asn 
280 

Lys Cys lie Ser 
295 

Val Gly Asn Pro 
310 

Asp Pro Thr Val 
325 

Val Leu Tyr Val 



Arg Cys Gly Lys 
360 

Phe His Asp Val 
375 



Ser Arg Leu Thr 
75 

Lys Ala Thr Pro 
90 

Asn Ser Tyr Val 
105 

Leu Asn Ser His 



Leu Phe Tyr Leu 
140 

Asn lie His Glu 
155 

Val Glu Lys Pro 
170 

Asn His lie Ser 
185 

His Tyr Leu Gly 



Ala Asn Arg lie 
220 

Val lie Leu Thr 
235 

Tyr Phe Asp Glu 
250 

Leu Gin Met Leu 
265 

Ser Asp Asp Val 



Glu Val Gin Ala 
300 

Asp Gly Glu Gly 
315 

Pro Arg Gly Ser 
330 

Glu Asn Glu Arg 
345 

Ala Leu Asn Glu 



Ala Gly Asp lie 
380 



Val Ala Asp He 
80 

Glu Glu Lys Leu 
95 

Ala Gly Gin Tyr 
110 

Met Asp Ala Leu 
125 

Ala Leu Pro Pro 



Ser Cys Met Ser 
160 

Phe Gly Arg Asp 
175 

Ser Leu Phe Arg 
190 

Lys Glu Met Val 
205 

Phe Gly Pro He 



Phe Lys Glu Pro 
240 

Phe Gly He He 
255 

Cys Leu Val Ala 
270 

Arg Asp Glu Lys 
285 

Asn Asn Val Val 



Glu Ala Thr Lys 
320 

Thr Thr Ala Thr 
335 

Trp Asp Gly Val 
350 

Arg Lys Ala Glu 
365 

Phe His Gin Gin 



f 



Cys Lys Arg Asn 
385 

Tyr Thr Lys Met 



Glu Ser Glu Leu 
420 

Leu Pro Asp Ala 
435 

Gin Met His Phe 
450 

Phe Thr Pro Leu 
465 

Pro Tyr lie Tyr 



Lys Arg Val Gly 
500 

His Lys Leu 
515 



Glu Leu Val He 
390 

Met Thr Lys Lys 
405 

Asp Leu Thr Tyr 



Tyr Glu Arg Leu 
440 

Val Arg Ser Asp 
455 

Leu His Gin He 
470 

Gly Ser Arg Gly 
485 

Phe Gin Tyr Glu 



Arg Val Gin Pro 
395 

Pro Gly Met Phe 
410 

Gly Asn Arg Tyr 
425 

He Leu Asp Val 



Glu Leu Arg Glu 
460 

Glu Leu Glu Lys 
475 

Pro Thr Glu Ala 
490 

Gly Thr Tyr Lys 
505 




Asn Glu Ala Val 
400 

Phe Asn Pro Glu 
415 

Lys Asn Val Lys 
430 

Phe Cys Gly Ser 
445 

Ala Trp Arg He 



Pro Lys Pro He 
480 

Asp Glu Leu Met 
495 

Trp Val Asn Pro 
510 



<210> 19 
<211> 1564 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> HCDR-3, proliferation-associated 2G4 (PA2G4) 
(protease) cDNA 



<400> 19 

ctttcgctcg 

gggagacaga 

tgtcgggcga 

ataagatggg 

caggtgtgtc 

ggaaaatctt 

cggtaaataa 

aggaaggtga 

tagctcacac 

atgttattaa 

atcagaacac 

caatagaagg 

ttatccagaa 

atgaagtata 

gacagagaac 

cttcacgtgc 

gagcatttga 

tgctgcaacc 

ttacagttct 

acctctacaa 

ctgcaagtcg 

ccaccagtgg 

ccagcttgct 



ccctctcctc 
ggcggcggcg 
ggacgagcaa 
gggcgacatc 
ggtactgagc 
caagaaagaa 
ctgtgtatgt 
cttggtaaaa 
ttttgtggtt 
ggcagctcac 
acaagtgaca 
tatgctgtca 
tcccacagac 
tgctgtggat 
cactatttac 
cttcttcagt 
agatgagaag 
atttaatgtt 
gctcatgccc 
gtctgagatg 
aaaaacccag 
ggaaacatta 
gctcctgcct 



gaggatcgag 
gctcagggga 
caggagcaaa 
gccaacaggg 
ctgtgtgaga 
aaggaaatga 
cacttctccc 
attgaccttg 
gatgtagctc 
ctttgtgctg 
gaagcctgga 
caccagttga 
cagcagaaga 
gttctcgtca 
aaacgagacc 
gaggtggaaa 
aaggctcgga 
ctctatgaga 
aatggcccca 
gaggtccagg 
aaaaagaaaa 
gaagaaaatg 
catccccttc 



gggactctga 
aacgaggctg 
ctatcgctga 
tacttcggtc 
aaggtgatgc 
agaaaggtat 
ctttgaagag 
gggtccatgt 
aggggaccca 
aagctgccct 
acaaagttgc 
agcagcatgt 
aggaccatga 
gctcaggaga 
cctctaaaca 
ggcgttttga 
tgggtgtggt 
aggagggtga 
tgcggataac 
atgcagagct 
aaaagaaggc 
aagctgggga 
ccaccaaacc 



ccacagcctg 
cagtggtggt 
ggacctggtc 
cttggtggaa 
catgattatg 
tgcttttccc 
cgaccaggat 
ggatggcttc 
agtaacaggg 
acgcctggtc 
ccactcattt 
catcgatgga 
aaaagctgaa 
gggcaaggcc 
gtatggactg 
tgccatgccg 
ggagtgcgcc 
atttgttgcc 
cagtggtccc 
aaaggccctc 
ctccaagact 
ctgaggtggg 
ccagactctg 



tggctgggaa 
agtaggaaga 
gtgaccaagt 
gcatctagct 
gaagaaacag 
accagcattt 
tatattctca 
atcgctaatg 
aggaaagcag 
aaacctggaa 
aactgcacgc 
gaaaaaacca 
tttgaggtac 
aaggatgcag 
aaaatgaaaa 
tttactttaa 
aaacatgaac 
cagtttaaat 
ttcgagcctg 
ctccagagtt 
gcagagaatg 
tcccatctcc 
tgaagtgcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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#1 

ttcttctcca cctaggaccg ccagcagagc 
cccaacccac tcccttccaa caacaaccag 
gcttcccaac cacggaagac tactttaaat 
agtc 




ggggggatct ccctgccccc accccagttc 1440 
ctccaactga ctctggtctt gggaggtgag 1500 
gaaaaaaaga aattgaataa taaaatcagg 1560 

1564 



<210> 20 
<211> 394 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> HCDR-3, proliferation-associated 2G4 (PA2G4) 
(protease) 

<400> 20 

Met Ser Gly Glu Asp Glu Gin Gin Glu Gin Thr lie Ala Glu Asp Leu 
15 10 15 

Val Val Thr Lys Tyr Lys Met Gly Gly Asp lie Ala Asn Arg Val Leu 
20 25 30 

Arg Ser Leu Val Glu Ala Ser Ser Ser Gly Val Ser Val Leu Ser Leu 
35 40 45 

Cys Glu Lys Gly Asp Ala Met lie Met Glu Glu Thr Gly Lys lie Phe 
50 55 60 

Lys Lys Glu Lys Glu Met Lys Lys Gly lie Ala Phe Pro Thr Ser lie 
65 70 75 80 

Ser Val Asn Asn Cys Val Cys His Phe Ser Pro Leu Lys Ser Asp Gin 
85 90 95 

Asp Tyr lie Leu Lys Glu Gly Asp Leu Val Lys lie Asp Leu Gly Val 
100 105 110 

His Val Asp Gly Phe lie Ala Asn Val Ala His Thr Phe Val Val Asp 
115 120 125 

Val Ala Gin Gly Thr Gin Val Thr Gly Arg Lys Ala Asp Val He Lys 
130 135 140 

Ala Ala His Leu Cys Ala Glu Ala Ala Leu Arg Leu Val Lys Pro Gly 
145 150 155 160 

Asn Gin Asn Thr Gin Val Thr Glu Ala Trp Asn Lys Val Ala His Ser 
165 170 175 

Phe Asn Cys Thr Pro He Glu Gly Met Leu Ser His Gin Leu Lys Gin 
180 185 190 

His Val He Asp Gly Glu Lys Thr He He Gin Asn Pro Thr Asp Gin 
195 200 205 

Gin Lys Lys Asp His Glu Lys Ala Glu Phe Glu Val His Glu Val Tyr 
210 215 220 

Ala Val Asp Val Leu Val Ser Ser Gly Glu Gly Lys Ala Lys Asp Ala 
225 230 235 240 
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Gly Gin Arg Thr 



Leu Lys Met Lys 
260 

Phe Asp Ala Met 
275 

Ala Arg Met Gly 
290 

Phe Asn Val Leu 
305 

Phe Thr Val Leu 



Pro Phe Glu Pro 
340 

Glu Leu Lys Ala 
355 

Lys Lys Lys Lys 
370 

Glu Thr Leu Glu 
385 



Thr lie Tyr Lys 
245 

Thr Ser Arg Ala 



Pro Phe Thr Leu 
280 

Val Val Glu Cys 
295 

Tyr Glu Lys Glu 
310 

Leu Met Pro Asn 
325 

Asp Leu Tyr Lys 



Leu Leu Gin Ser 
360 

Lys Ala Ser Lys 
375 

Glu Asn Glu Ala 
390 



Arg Asp Pro Ser 
250 

Phe Phe Ser Glu 
265 

Arg Ala Phe Glu 



Ala Lys His Glu 
300 

Gly Glu Phe Val 
315 

Gly Pro Met Arg 
330 

Ser Glu Met Glu 
345 

Ser Ala Ser Arg 



Thr Ala Glu Asn 
380 

Gly Asp 




Lys Gin Tyr Gly 
255 

Val Glu Arg Arg 
270 

Asp Glu Lys Lys 
285 

Leu Leu Gin Pro 



Ala Gin Phe Lys 
320 

lie Thr Ser Gly 
335 

Val Gin Asp Ala 
350 

Lys Thr Gin Lys 
365 

Ala Thr Ser Gly 



<210> 21 

<211> 3273 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> DEAD/H (Asp-Glu-Ala-Asp/His ) box polypeptide 21 
(DDX21) (RNA helicase II) cDNA 



<400> 21 

gaagaccggt 

tggtttggaa 

gaaagagaaa 

aactgttttc 

gaattctcct 

tcctaaaacc 

taaaaccaaa 

gcccaagaag 

actgaagaat 

agaaagtaac 

taattttccc 

atttcctata 

ggcacggaca 

tggggaactg 

aagagagttg 

ggcttgtttt 

tgatatcctg 

tctcaccaaa 

tgctgatcaa 

ccaaacattg 



cggcctgggc 
tcagacaccg 
aaagagaagc 
cccaaagcta 
aaatccaaaa 
aaaagtttga 
aaagtgacaa 
atgaagaaag 
ggatttcctc 
agtgagatag 
atatctgaag 
caagcaaaga 
ggaactggga 
caagacagga 
gcaaatcaag 
tatggtggaa 
gttggaacac 
cttaagcatg 
gtggaagaga 
cttttttctg 



aacctgcgct 
caatgaaaaa 
caaaatctga 
aacaagttaa 
aggcaaaaaa 
gaaagaaaaa 
aaaatgagga 
aaaaggaaat 
atcctgaacc 
agcaggaaat 
aaactattaa 
cattccatca 
agacattctc 
agagaggccg 
taagcaaaga 
ctccctatgg 
caggtcgtat 
ttgtcctgga 
ttttaagtgt 
caacttgccc 



gaagatgccg 
aggggagaca 
taagactgaa 
aaagaaagca 
gaaagaggag 
ggagcccatt 
gccttctgag 
gaatggagaa 
ggactgtaac 
acctgtggaa 
acttctcaaa 
tgtttacagc 
ctttgccatc 
tgcccctcag 
cttcagtgac 
aggtcaattt 
caaagaccac 
tgaagtggac 
ggcatacaag 
tcattgggta 



ggaaaactcc 
ctgcgaaagc 
gagatagcag 
gagccttctg 
ccatctcaaa 
gaaaagaaag 
gaagaaatag 
actagagaga 
cccagtgaag 
caaaaagaag 
ggccgaggag 
gggaaggact 
cctttgattg 
gtactggttc 
atcacaaaaa 
gaacgcatga 
atacagaatg 
cagatgttgg 
aaagattctg 
tttaatgttg 



gtagtgacgc 
aaaccgagga 
aagaggaaga 
aagttgacat 
atgacatttc 
tggtttcttc 
atgctcctaa 
aaagccccaa 
ctgccagtga 
gcgctttctc 
tgaccttcct 
taattgcaca 
agaaacttca 
ttgcacctac 
agctgtcagt 
ggaatgggat 
gcaaactaga 
atatgggatt 
aagacaatcc 
ccaagaaata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



34 



#1 



catgaaatct 
aactgtggag 
tgtcatccga 
agaagcccag 
agacattcca 
agttttggtg 
tatacaaagc 
gcagagctgg 
tagtacaagt 
cagaaataat 
ctgccattag 
aagctctggc 
tgatcaactc 
atattagtta 
tgaagggaat 
cagtaacaga 
cagagcaacc 
aaggcagtcg 
aaggcagtag 
aaaacaaagg 
atttatatag 
caaagttaaa 
acacagacaa 
tacttcttca 
tgtattatct 
cacctgtctt 
agggagtgta 
ccacctgccg 
gcaatgtttt 
atgctctaga 
ttgatacatt 
tagtatatat 
aaggtggctc 
tactccactt 
gggggcgcag 



acatatgaac 
catctggcta 
gtatatagtg 
gagctgtccc 
cagaagcaaa 
gcaaccaatg 
tctccaccaa 
aaggacgggg 
ggagcaaaaa 
aaaagcttcc 
tcacttcaaa 
agcagcactg 
aaatgtgggt 
tgcttggaaa 
ggtttttctc 
aatacaggag 
agaactggaa 
aggcttcagg 
aggcccgaga 
ccagaagcgg 
caaaaagaga 
agcacattgt 
gcttcattta 
tcagtttttc 
gtagattaga 
tatactcaaa 
acagtctctg 
aacagtttct 
gaaacaagat 
cttggaagat 
tttagcttct 
aaaagtacat 
catagcttta 
cttcctattg 
attagcattg 



aggtggacct 
ttaagtgcca 
gtcatcaagg 
agaattcagc 
gggaaatcac 
ttgctgcacg 
agggatgtag 
gtgtgcatct 
gcgggaatta 
agcaaagatg 
caatcagctg 
gcccatattt 
tttgtgacca 
gaacttaaag 
aaaggaaagc 
aaatggcatg 
ggaccacggg 
ggacagcggg 
ggacagcgat 
agtttcagta 
atgatgtttg 
gcctcctttt 
aattatttca 
cttttgaaag 
agataaaatc 
agtgtccctt 
gaggaccact 
catgtggtcc 
ttcaaactaa 
gtagtatgtt 
cattataagg 
tttaatagaa 
tttgtaagta 
gaagattaac 
ctcaagagta 



gattggtaaa 
ctggactcag 
acgcactatc 
tataaagcag 
cctgaaaggt 
tgggttagac 
agtcctacat 
gcttttatca 
agttcaaacg 
ccatcaggct 
agaagctgat 
caggtgccac 
tgatcttgca 
agcagctggg 
tgggtgtttg 
attcacgacg 
aaggatatgg 
acggaaacag 
caggaggtgg 
aagcatttgg 
gcaatataga 
gaccacttgc 
tctgatcatt 
gtgtatgaat 
aagcatgtat 
aatagtgtcc 
ttgagccttt 
tattatttgt 
tctgggttgt 
tgatgtggat 
tgattcatgc 
agccagggtt 
ggctggataa 
attatttacc 
tgt 



aagactcaga 
agggcagcag 
atcttttgtg 
gatgctcagt 
tttagaaatg 
atccctgagg 
tcatcgatcc 
gcacaaggaa 
aataggtgtt 
tttggattcc 
agaggagaag 
gtccgtagac 
gtgctcaatt 
cgaggagatt 
ctttgatgta 
ctggcagctc 
aggcttcagg 
aagattcaga 
caacaaaagt 
tcaataatta 
actgaacatt 
caagtccctg 
atcatttata 
tcattacttt 
ctgcctatac 
ttccctgaaa 
ggaagttaag 
ctactgagac 
aatacagttt 
tacctatact 
tttagtgaat 
ttaaggaatt 
atggtgctta 
aagaaggact 



aaacggcaat 
ttattgggga 
aaaccaagaa 
ccttgcatgg 
gtagttttgg 
ttgatttggt 
gggcggacag 
gaatatcagt 
ccttctgcaa 
gtgcctccca 
ggagctgtgg 
cagcgctcct 
gaaatgccaa 
gattccaaag 
cctaccgcat 
tctgtggcca 
ggacagcggg 
ggacagcggg 
aacagatccc 
gaaatagaag 
atttttcatg 
tctctttcag 
actttattgt 
tttattctaa 
tttgtgagtt 
taaataccta 
gtttcctcag 
ttaatactga 
ataccagtgt 
tatgttcgtt 
tcttcataga 
tcacatgtat 
aatggtaatg 
taagggagta 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3273 



<210> 22 
<211> 558 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 21 
(DDX21) (RNA helicase II) 

<400> 22 

Met Pro Gly Lys Leu Arg Ser Asp Ala Gly Leu Glu Ser Asp Thr Ala 
15 10 15 

Met Lys Lys Gly Glu Thr Leu Arg Lys Gin Thr Glu Glu Lys Glu Lys 
20 25 30 

Lys Glu Lys Pro Lys Ser Asp Lys Thr Glu Glu lie Ala Glu Glu Glu 
35 40 45 

Glu Thr Val Phe Pro Lys Ala Lys Gin Val Lys Lys Lys Ala Glu Pro 
50 55 60 

Ser Glu Val Asp Met Asn Ser Pro Lys Ser Lys Lys Ala Lys Lys Lys 
65 70 75 80 



35 



Glu Glu Pro Ser 



Lys Lys Lys Glu 
100 

Lys Val Thr Lys 
115 

Lys Pro Lys Lys 
130 

Glu Lys Ser Pro 
145 

Cys Asn Pro Ser 



Gin Glu He Pro 
180 

He Ser Glu Glu 
195 

Leu Phe Pro He 
210 

Asp Leu He Ala 
225 

Ala He Pro Leu 



Arg Gly Arg Ala 
260 

Ala Asn Gin Val 
275 

Val Ala Cys Phe 
290 

Met Arg Asn Gly 
305 

Asp His He Gin 



Val Leu Asp Glu 
340 

Val Glu Glu He 
355 

Pro Gin Thr Leu 
370 

Val Ala Lys Lys 
385 



* 

Gin Asn Asp He 
85 

Pro He Glu Lys 



Asn Glu Glu Pro 
120 

Met Lys Lys Glu 
135 

Lys Leu Lys Asn 
150 

Glu Ala Ala Ser 
165 

Val Glu Gin Lys 



Thr He Lys Leu 
200 

Gin Ala Lys Thr 
215 

Gin Ala Arg Thr 
230 

He Glu Lys Leu 
245 

Pro Gin Val Leu 



Ser Lys Asp Phe 
280 

Tyr Gly Gly Thr 
295 

He Asp He Leu 
310 

Asn Gly Lys Leu 
325 

Val Asp Gin Met 



Leu Ser Val Ala 
360 

Leu Phe Ser Ala 
375 

Tyr Met Lys Ser 
390 



Ser Pro Lys Thr 
90 

Lys Val Val Ser 
105 

Ser Glu Glu Glu 



Lys Glu Met Asn 
140 

Gly Phe Pro His 
155 

Glu Glu Ser Asn 
170 

Glu Gly Ala Phe 
185 

Leu Lys Gly Arg 



Phe His His Val 
220 

Gly Thr Gly Lys 
235 

His Gly Glu Leu 
250 

Val Leu Ala Pro 
265 

Ser Asp He Thr 



Pro Tyr Gly Gly 
300 

Val Gly Thr Pro 
315 

Asp Leu Thr Lys 
330 

Leu Asp Met Gly 
345 

Tyr Lys Lys Asp 



Thr Cys Pro His 
380 

Thr Tyr Glu Gin 
395 



+ 

Lys Ser Leu Arg 
95 

Ser Lys Thr Lys 
110 

He Asp Ala Pro 
125 

Gly Glu Thr Arg 



Pro Glu Pro Asp 
160 

Ser Glu He Glu 
175 

Ser Asn Phe Pro 
190 

Gly Val Thr Phe 
205 

Tyr Ser Gly Lys 



Thr Phe Ser Phe 
240 

Gin Asp Arg Lys 
255 

Thr Arg Glu Leu 
270 

Lys Lys Leu Ser 
285 

Gin Phe Glu Arg 



Gly Arg He Lys 
320 

Leu Lys His Val 
335 

Phe Ala Asp Gin 
350 

Ser Glu Asp Asn 
365 

Trp Val Phe Asn 



Val Asp Leu He 
400 
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Gly Lys Lys Thr 



Lys Cys His Trp 
420 

Val Tyr Ser Gly 
435 

Lys Glu Ala Gin 
450 

Gin Ser Leu His 
465 

Lys Gly Phe Arg 



Ala Ala Arg Gly 
500 

Ser Pro Pro Lys 
515 

Arg Gin Ser Trp 
530 

Gly Arg lie Ser 
545 



m 

Gin Lys Thr Ala 
405 

Thr Gin Arg Ala 



His Gin Gly Arg 
440 

Glu Leu Ser Gin 
455 

Gly Asp lie Pro 
470 

Asn Gly Ser Phe 
485 

Leu Asp lie Pro 



Gly Cys Arg Val 
520 

Lys Asp Gly Gly 
535 

Val Ser Thr Ser 
550 



He Thr Val Glu 
410 

Ala Val He Gly 
425 

Thr He He Phe 



Asn Ser Ala He 
460 

Gin Lys Gin Arg 
475 

Gly Val Leu Val 
490 

Glu Val Asp Leu 
505 

Leu His Ser Ser 



Val His Leu Leu 
540 

Gly Ala Lys Ser 
555 




His Leu Ala He 
415 

Asp Val He Arg 
430 

Cys Glu Thr Lys 
445 

Lys Gin Asp Ala 



Glu He Thr Leu 
480 

Ala Thr Asn Val 
495 

Val He Gin Ser 
510 

He Arg Ala Asp 
525 

Leu Ser Ala Gin 



Gly Asn 



<210> 23 

<211> 2005 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> ARK2, serine/ threonine kinase 15 (STK15) cDNA 



<400> 23 

cttgggtcct 

tgttaaggct 

ttgtcagaat 

ttcttcccag 

gaatcagaag 

gaataacacc 

ggaactggca 

ctttgaaatt 

aaagcaaagc 

cggagtggag 

tattcttaga 

tgcaccactt 

aactgctact 

tattcataga 

tgcagatttt 

cctggactac 

ctggagcctt 

cacataccaa 

aacagaggga 

aatgctcaga 

ttgccaaaac 



tgggtcgcag 
acagctccag 
ccattacctg 
cgcattcctt 
cagaagcaat 
caaaagagca 
tcaaaacaga 
ggtcgccctc 
aagtttattc 
catcagctca 
ctgtatggtt 
ggaacagttt 
tatataacag 
gacattaagc 
gggtggtcag 
ctgccccctg 
ggagttcttt 
gagacctaca 
gccagggacc 
gaagtacttg 
aaagaatcag 



gcatcatgga 
ttggaggtcc 
taaatagtgg 
tgcaagcaca 
tgcaggcaac 
agcagcccct 
aaaatgaaga 
tgggtaaagg 
tggctcttaa 
gaagagaagt 
atttccatga 
atagagaact 
aattggcaaa 
cagagaactt 
tacatgctcc 
aaatgattga 
gctatgaatt 
aaagaatatc 
tcatttcaag 
aacacccctg 
ctagcaaaca 



ccgatctaaa 
aaaacgtgtt 
ccaggctcag 
aaagcttgtc 
cagtgtacct 
gccatcggca 
atcaaaaaag 
aaagtttggt 
agtgttattt 
agaaatacag 
tgctaccaga 
tcagaaactt 
tgccctgtct 
acttcttgga 
atcttccagg 
aggtcggatg 
tttagttggg 
acgggttgaa 
actgttgaag 
gatcacagca 
gtcttaggaa 



gaaaactgca 
ctcgtgactc 
cgggtcttgt 
tccagtcaca 
catcctgtct 
cctgaaaata 
aggcagtggg 
aatgtttatt 
aaagctcagc 
tcccaccttc 
gtctacctaa 
tcaaagtttg 
tactgtcatt 
tcagctggag 
aggaccactc 
catgatgaga 
aagcctcctt 
ttcacattcc 
cataatccca 
aattcatcaa 
tcgtgcaggg 



tttcaggacc 
agcaatttcc 
gtccttcaaa 
agccggttca 
ccaggccact 
atcctgagga 
ctttggaaga 
tggcaagaga 
tggagaaagc 
ggcatcctaa 
ttctggaata 
atgagcagag 
cgaagagagt 
agcttaaaat 
tctgtggcac 
aggtggatct 
ttgaggcaaa 
ctgactttgt 
gccagaggcc 
aaccatcaaa 
ggagaaatcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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ttgagccagg 
gactgctgcc 
ccttaacctc 
agccacgaga 
ccgccccgtc 
tgtggggaaa 
tcctagtacc 
aatgagtatg 
ttctctggtg 
ttgggtttct 
acgtgctcta 
gctgttaagg 
atgcaaataa 



gctgccatat 
ctcaatctag 
cctattcaga 
attgtgctac 
agcctgtgct 
gtgaccactc 
tgagtgagtg 
tgattctttt 
gcattccttt 
agtcctcctt 
cctccattta 
gcttattttt 
ataagtatct 



aacctgacag 
aacgctacac 
aagctccaca 
ttatactggt 
aggcatggtg 
tgccctgacc 
tgtaacttat 
taagtatgaa 
aggaatgctg 
aaccacttat 
gggatttgct 
ttaaaacatt 
atgtc 



gaacatgcta 
aagaaatatt 
tcaataaaca 
tcataatctg 
tcttcacagg 
ccgatcagtt 
tgggttggcg 
aataaagata 
tgtgtctgtc 
ctcccatatg 
tgggatacag 
ggagtcatag 



ctgaagttta 
tgttttactc 
tgacactctg 
gaggcaaggt 
aggcaaatcc 
aaggagctgt 
aagcctggta 
tatgtacaga 
cggcaccccg 
agagtgtgaa 
aagaggccat 
catgtgtgta 



ttttaccatt 
agcaggtgtg 
aagtgaaagt 
tcgactgcag 
agagcctggc 
gcaataacct 
aagctgttgg 
cttgtatttt 
gtaggcctga 
aaataggaac 
gtgtctcaga 
aactttaaat 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2005 



<210> 24 
<211> 403 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> ARK2, serine/ threonine kinase 15 (STK15) 
<400> 24 

Met Asp Arg Ser Lys Glu Asn Cys lie Ser Gly Pro Val Lys Ala Thr 
1 5 10 15 

Ala Pro Val Gly Gly Pro Lys Arg Val Leu Val Thr Gin Gin Phe Pro 
20 25 30 

Cys Gin Asn Pro Leu Pro Val Asn Ser Gly Gin Ala Gin Arg Val Leu 
35 40 45 

Cys Pro Ser Asn Ser Ser Gin Arg lie Pro Leu Gin Ala Gin Lys Leu 
50 55 60 

Val Ser Ser His Lys Pro Val Gin Asn Gin Lys Gin Lys Gin Leu Gin 
65 70 75 80 

Ala Thr Ser Val Pro His Pro Val Ser Arg Pro Leu Asn Asn Thr Gin 
85 90 95 

Lys Ser Lys Gin Pro Leu Pro Ser Ala Pro Glu Asn Asn Pro Glu Glu 
100 105 110 

Glu Leu Ala Ser Lys Gin Lys Asn Glu Glu Ser Lys Lys Arg Gin Trp 
115 120 125 

Ala Leu Glu Asp Phe Glu lie Gly Arg Pro Leu Gly Lys Gly Lys Phe 
130 135 140 

Gly Asn Val Tyr Leu Ala Arg Glu Lys Gin Ser Lys Phe lie Leu Ala 
145 150 155 160 

Leu Lys Val Leu Phe Lys Ala Gin Leu Glu Lys Ala Gly Val Glu His 
165 170 175 

Gin Leu Arg Arg Glu Val Glu lie Gin Ser His Leu Arg His Pro Asn 
180 185 190 
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lie Leu Arg Leu Tyr Gly Tyr Phe His Asp Ala Thr Arg Val Tyr Leu 
195 200 205 

lie Leu Glu Tyr Ala Pro Leu Gly Thr Val Tyr Arg Glu Leu Gin Lys 
210 215 220 

Leu Ser Lys Phe Asp Glu Gin Arg Thr Ala Thr Tyr lie Thr Glu Leu 
225 230 235 240 

Ala Asn Ala Leu Ser Tyr Cys His Ser Lys Arg Val lie His Arg Asp 
245 250 255 

lie Lys Pro Glu Asn Leu Leu Leu Gly Ser Ala Gly Glu Leu Lys lie 
260 265 270 

Ala Asp Phe Gly Trp Ser Val His Ala Pro Ser Ser Arg Arg Thr Thr 
275 280 285 

Leu Cys Gly Thr Leu Asp Tyr Leu Pro Pro Glu Met lie Glu Gly Arg 
290 295 300 

Met His Asp Glu Lys Val Asp Leu Trp Ser Leu Gly Val Leu Cys Tyr 
305 310 315 320 

Glu Phe Leu Val Gly Lys Pro Pro Phe Glu Ala Asn Thr Tyr Gin Glu 
325 330 335 

Thr Tyr Lys Arg lie Ser Arg Val Glu Phe Thr Phe Pro Asp Phe Val 
340 345 350 

Thr Glu Gly Ala Arg Asp Leu lie Ser Arg Leu Leu Lys His Asn Pro 
355 360 365 

Ser Gin Arg Pro Met Leu Arg Glu Val Leu Glu His Pro Trp lie Thr 
370 375 380 

Ala Asn Ser Ser Lys Pro Ser Asn Cys Gin Asn Lys Glu Ser Ala Ser 
385 390 395 400 

Lys Gin Ser 



<210> 25 

<211> 1584 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> similar to transmembrane 4 superfamily member 1 
(TM4 SF1) CDNA 

<400> 25 

gtggtgtttg ctttctccac cagaagggca cactttcatc taatttgggg tatcactgag 60 

ctgaagacaa agagaagggg gagaaaacct agcagaccac catgtgctat gggaagtgtg 12 0 

cacgatgcat cggacattct ctggtggggc tcgccctcct gtgcatcgcg gctaatattt 180 

tgctttactt tcccaatggg gaaacaaagt atgcctccga aaaccacctc agccgcttcg 240 

tgtggttctt ttctggcatc gtaggaggtg gcctgctgat gctcctgcca gcatttgtct 300 

tcattgggct ggaacaggat gactgctgtg gctgctgtgg ccatgaaaac tgtggcaaac 360 

gatgtgcgat gctttcttct gtattggctg ctctcattgg aattgcagga tctggctact 420 

gtgtcattgt ggcagccctt ggcttagcag aaggaccact atgtcttgat tccctcggcc 480 

agtggaacta cacctttgcc agcaccgagg gccagtacct tctggatacc tccacatggt 540 



39 



m 



ccgagtgcac 
tggctcttgg 
gaggcatatg 
ccaggacaga 
atttgtaaaa 
taaagactgg 
gtttatttgt 
ttacagactg 
aaataacatt 
agttgctttt 
cactgcttgt 
aactgccttg 
tgttccaatc 
gagattcaaa 
atcatgtgtt 
gaataaagca 
gaggaaatac 
aaaaaaaaaa 



tgaacccaag 
tggaattgaa 
tggcttttgc 
gccacaatct 
ctttgtatta 
catcttcaca 
ttttgttttt 
agtgacagta 
ccaatcacta 
tataagacca 
atgatgtttc 
tgttctgtga 
caaatgaatg 
tttttctaac 
taaaaaaaag 
gttgatcagc 
cctcaaaact 
aaaaaaaaaa 



cacattgtgg 
ttcatcttgt 
tgctctcacc 
tcctctattt 
gtgtaacata 
ggatgtcagt 
tttttaggaa 
ctcagtatat 
ttgtatatat 
agaaggagaa 
ccattcatac 
gaaacaaata 
catcacaact 
atatggaaag 
aaaggctacg 
atcattggaa 
aacttgttta 
aaaa 



aatggaatgt 
gtcttattca 
aacagcaata 
cattgtaatt 
ctccccacag 
gtttaaattt 
tgaggaaaca 
ctgagataaa 
gtgcatgtat 
aatccgacaa 
acctataaat 
tttacttaga 
tacaatgctg 
ccttttgtcc 
atgactgggc 
catggggacg 
caacaaaata 



atctctgttt 
agtaataaat 
tgactgctaa 
tatatatttc 
tctactttta 
agtaaacttc 
aaccaccctc 
ctctataatg 
tttttaaatt 
cctggaaaga 
ctctaacaag 
gtggaaggac 
ctcattgttg 
tccaaagatg 
aagaagaaag 
agtgacggca 
aagtattcac 



tctatcctct 
ggagtgcttg 
aagaaccaac 
acttgtattc 
caaacgcctg 
ttttttgttt 
tgggggtagt 
ttttggataa 
aaagatgtct 
tttttgtttt 
aggccctttg 
tgattgagaa 
tgagtactat 
agtactaggg 
atgggaaact 

ggaggaccac 

tacgaaaaaa 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1584 



<210> 26 
<211> 202 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> similar to transmembrane 4 superfamily member 1 
(TM4 SF1) 

<400> 26 

Met Cys Tyr Gly Lys Cys Ala Arg Cys lie Gly His Ser Leu Val Gly 
15 10 15 

Leu Ala Leu Leu Cys lie Ala Ala Asn lie Leu Leu Tyr Phe Pro Asn 
20 25 30 

Gly Glu Thr Lys Tyr Ala Ser Glu Asn His Leu Ser Arg Phe Val Trp 
35 40 45 

Phe Phe Ser Gly lie Val Gly Gly Gly Leu Leu Met Leu Leu Pro Ala 
50 55 60 

Phe Val Phe lie Gly Leu Glu Gin Asp Asp Cys Cys Gly Cys Cys Gly 
65 70 75 80 

His Glu Asn Cys Gly Lys Arg Cys Ala Met Leu Ser Ser Val Leu Ala 
85 90 95 

Ala Leu He Gly He Ala Gly Ser Gly Tyr Cys Val He Val Ala Ala 
100 105 110 

Leu Gly Leu Ala Glu Gly Pro Leu Cys Leu Asp Ser Leu Gly Gin Trp 
115 120 125 

Asn Tyr Thr Phe Ala Ser Thr Glu Gly Gin Tyr Leu Leu Asp Thr Ser 
130 135 140 

Thr Trp Ser Glu Cys Thr Glu Pro Lys His He Val Glu Trp Asn Val 
145 150 155 160 



40 



Ser Leu Phe Ser lie Leu Leu Ala Leu Gly Gly lie Glu Phe lie Leu 
165 170 175 

Cys Leu lie Gin Val lie Asn Gly Val Leu Gly Gly lie Cys Gly Phe 
180 185 190 

Cys Cys Ser His Gin Gin Gin Tyr Asp Cys 
195 200 



<210> 27 

<211> 1025 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> excision repair cross -complementing rodent repair 
deficiency, complementation group 1 (ERCC1) cDNA 



<400> 27 

ccaagaccag 

agatggaccc 

agaaatttgt 

tccgatctac 

acgccgaata 

cagagcccct 

gcatcattgt 

cctgggaatt 

tcctcagcct 

tggggaagaa 

aggccctcaa 

ggagccccga 

cggacctcct 

ccaccgtgaa 

tggaacagct 

agaaagcccg 

cccagctgcc 

ctggc 



caggtgaggc 
tgggaaggac 
gatacccctc 
acagagcctt 
tgccatctca 
ggcaggagag 
gagccctcgg 
tggcgacgta 
ccgctaccac 
cttcgccttg 
ggagctggct 
ggaagctggg 
gatggagaag 
gtcagtcaac 
catcgccgca 
gaggctgttt 
aaggaaaccc 



ctcgcggcgc 
aaagaggggg 
gacgaggatg 
cccactgtgg 
cagcctctgg 
acgcccaacc 
cagaggggca 
attcccgact 
aacctgcacc 
cgggtcctgc 
aagatgtgta 
cggtacctgg 
ctagagcagg 
aaaacggaca 
tcaagagaag 
gatgtcctgc 
ccagtgtaat 



tgaaaccgtg 
tgccccagcc 
aggtccctcc 
acacctcggc 
aaggggctgg 
aggccctgaa 
atcccgtact 
atgtgctggg 
cagactacat 
ttgtccaggt 
tcctggccga 
agacctacaa 
acttcgtctc 
gtcagaccct 
atctggcctt 
acgagccctt 
aataaatcgt 



aggcccggac 
ctcagggccg 
tggagtggcc 
ccaggcggcc 
ggccacgtgc 
acccggggca 
gaagttcgtg 
ccagagcacc 
ccatgggcgg 
ggatgtgaaa 
ctgcacattg 
ggcctatgag 
ccgggtgact 
cctgaccaca 
atgcccaggc 
cttgaaagta 
cctcccaggc 



cacaggctcc 
ccagcaagga 
aagcccttat 
cctcagacct 
cccacagggt 
aaatccaaca 
cgcaatgtgc 
tgtgccctgt 
ctgcagagcc 
gatccccagc 
atcctcgcct 
cagaaaccag 
gaatgtctga 
tttggatctc 
ctgggccctc 
ccctgatgac 
caggctcctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1025 



<210> 28 

<211> 297 

<212> PRT 

<213> Homo sapiens 
<220> 

<223> excision repair cross -complementing rodent repair 
deficiency, complementation group 1 (ERCC1) 



<400> 28 

Met Asp Pro Gly Lys Asp Lys Glu Gly Val Pro Gin Pro Ser Gly Pro 
1 5 10 15 

Pro Ala Arg Lys Lys Phe Val lie Pro Leu Asp Glu Asp Glu Val Pro 
20 25 30 

Pro Gly Val Ala Lys Pro Leu Phe Arg Ser Thr Gin Ser Leu Pro Thr 
35 40 45 

Val Asp Thr Ser Ala Gin Ala Ala Pro Gin Thr Tyr Ala Glu Tyr Ala 
50 55 60 



41 



lie Ser Gin Pro Leu Glu Gly Ala Gly Ala Thr Cys Pro Thr Gly Ser 
65 70 75 80 

Glu Pro Leu Ala Gly Glu Thr Pro Asn Gin Ala Leu Lys Pro Gly Ala 
85 90 95 

Lys Ser Asn Ser lie He Val Ser Pro Arg Gin Arg Gly Asn Pro Val 
100 105 110 

Leu Lys Phe Val Arg Asn Val Pro Trp Glu Phe Gly Asp Val He Pro 
115 120 125 

Asp Tyr Val Leu Gly Gin Ser Thr Cys Ala Leu Phe Leu Ser Leu Arg 
130 135 140 

Tyr His Asn Leu His Pro Asp Tyr He His Gly Arg Leu Gin Ser Leu 
145 150 155 160 

Gly Lys Asn Phe Ala Leu Arg Val Leu Leu Val Gin Val Asp Val Lys 
165 170 175 

Asp Pro Gin Gin Ala Leu Lys Glu Leu Ala Lys Met Cys He Leu Ala 
180 185 190 

Asp Cys Thr Leu He Leu Ala Trp Ser Pro Glu Glu Ala Gly Arg Tyr 
195 200 205 

Leu Glu Thr Tyr Lys Ala Tyr Glu Gin Lys Pro Ala Asp Leu Leu Met 
210 215 220 

Glu Lys Leu Glu Gin Asp Phe Val Ser Arg Val Thr Glu Cys Leu Thr 
225 230 235 240 

Thr Val Lys Ser Val Asn Lys Thr Asp Ser Gin Thr Leu Leu Thr Thr 
245 250 255 

Phe Gly Ser Leu Glu Gin Leu He Ala Ala Ser Arg Glu Asp Leu Ala 
260 265 270 

Leu Cys Pro Gly Leu Gly Pro Gin Lys Ala Arg Arg Leu Phe Asp Val 
275 280 285 

Leu His Glu Pro Phe Leu Lys Val Pro 
290 295 



<210> 29 
<211> 181 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> cell tracker assay functional hit from retroviral 
screen FANCA partial cDNA 

<400> 29 

ccagtgtgct ggaaaggagg aagatatcct ggctggcact ctttcagttg acagagagtg 60 

acctcaggct ggggcggctc ctcctccgtg tggccccgga tcagcacacc aggctgctgc 120 

ctttcgcttt ttacagtctt ctctcctact tccatgaaga cgcggctttc cagcacagtg 180 
g 181 
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<210> 30 

<211> 603 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> cell tracker assay functional hit from retroviral 
screen DDX9 partial cDNA 



<400> 30 

ccagtgtgct 

cccctaccac 

tagtctcctc 

acacctgcac 

tagcctccac 

tatatccgct 

gtgtactgcc 

tgttcagcat 

cttcaacaac 

gagatatttg 

tgg 



ggaaagcgcc 
cwcctcgctg 
cagagttgcc 
catatcctgc 
tgctatagcc 
tccattgtcg 
aatcataagg 
acgttcattt 
caaagcctcc 
cagtttaatc 



acctcctctt 
gaatccccca 
tctaaagcca 
ccgaaaggag 
accgcatagc 
tatcgggcca 
ttgataccag 
acggggtcca 
atggctgccc 
cagtcatcta 



ccctgtccaa 
gatcctctgt 
cctcgggaga 
ttggcgctgc 
ctccaccact 
tcttgggagg 
cagctgaggg 
actggctgat 
ggagaccagt 
caagcacaat 



agtagccagt 
agcctccact 
ctcctctata 
caccatagcc 
gtaactagaa 
acgtggacca 
tctagagatc 
gatagcaggt 
gatacaggca 
ctgcccactt 



tccataggcc 
aggccctctg 
gcctccacca 
tccgctacca 
cctccccttc 
tctccatgcc 
tgacggatca 
tgtttggtta 
gcagcttcat 
tccagcacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

603 



<210> 31 

<211> 145 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> cell tracker assay functional hit from retroviral 
screen IGF1R partial cDNA 

<400> 31 

ccagtgtgtt ggaaagggag agaactgtca tttctaacct tcggcctttc acattgtacc 60 
gcatcgatat ccacagctgc aaccacgagg ctgagaagct gggctgcagc gcctccaact 120 
tcgtctttgc tttccagcac agtgg 145 



<210> 32 
<211> 269 
<212> DNA 
<213> Homo sapiens 

<220> 

<223> cell tracker assay functional hit from retroviral 
screen UBEV2V1 partial cDNA 

<400> 32 

ccagtgtgct ggaaaggtgc ttctgggtat ttaggtccac attctatttt aaggctgtat 
attcggtttt cataaattgt tcttggaggc ccaattatca tccctgtcca tcttgtaaga 120 
tgtcatgtct tcgtcatctt ctagacccca gctaactgtg ccatctccta ctcctttctg 180 
gccttcttcg agattcctcc aacagtcgga aattgcgagg gactttatac atcccgagcc 240 
cgtggtggct gccctttcca gcacactgg 269 



60 



<210> 33 

<211> 499 

<212> DNA 

<213> Homo sapiens 



43 



* 



m 



<220> 

<223> cell tracker assay functional hit from retroviral 
screen aldehyde dehydrogenase partial cDNA 



<400> 33 

ccagtgtgct 

agaggcaggg 

ggcagccatt 

tggcaagttg 

tggaggaaag 

tgcacaccat 

tgtggaagaa 

tatatccttg 

gactttccag 



ggaaaggagc 
tttcctcctg 
tcttctcaca 
atcaaagaag 
agcccttgca 

ggggtattct 

tcaatttatg 
gaaacatcct 
acacagtgg 



aaactcctct 
gagtagtgaa 
tggatataga 
ctgccgggaa 
ttgtgttagc 
accaccaggg 
atgagttttg 
ctgaccccag 



cactgctctc 
tattgttcct 
caaagtagcc 
aagcaatctg 
tgatgccgac 
ccagtgttgt 
ttcgaaggag 
gagtcactca 



cacgtggcat 
ggttatgggc 
ttcacaggat 
aagagggtga 
ttggacaatg 
atagccgcat 
tgttgagcgg 
aaggccctca 



ctttaataaa 60 
ctacagcagg 12 0 
caacagaggt 18 0 
ccctggagct 240 
ctgttgaatt 300 
ccaggatttt 360 
gctaagaacg 42 0 
gattgacaag 480 
499 



<210> 34 
<211> 425 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> cell tracker assay functional hit from retroviral 
screen pyruvate dehydrogenase partial cDNA 



<400> 34 

ccagtgtgct 

acaggatgtt 

ctcccttctt 

caggtccttt 

agctttccgt 

gagagaagtt 

ctggatcggg 

agtgg 



ggaaaggctg 
ctcgtcacac 
cagctccacc 
agtgtctaga 
ggctgtgcgc 
cagacgagcc 
aagctgggcc 



cccacttcca 
ttttccatgt 
tctgcagtgc 
gccacagcaa 
acattcttga 
acattcattc 
aatggtacag 



ccaccttgca 
aggcgttatc 
cgctgccctt 
cgggccggta 
tggtctccgc 
cagacttaat 
atgatgccag 



gatgttcttg 
cagcgtgatt 
gatgagccca 
gaggatgggg 
atggtactca 
catctccttc 
tgttccgggc 



tagtccagcc 60 
ttgagagtgg 12 0 
gttcggatct 180 
tcagaagcaa 240 
tgagttccat 300 
aacgtctcca 360 
tttccagcac 420 
425 



<210> 35 
<211> 238 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> cell tracker assay functional hit from retroviral 
screen G6PD partial cDNA 

<220> 

<221> modif ied_base 

<222> (1) . . (238) 

<223> n = g, a, c or t 

<400> 35 

ccagtgtgct ggaaactttc cagttctcca tggccaccan acacagcatc tgcagtaggt 60 
ggttctgcat cacgtcccgg atgatcccaa attcatcgaa atagcccccg cgaccctcag 120 
tgccaaaggg ctccttgaag gtgaggataa cgcaggcgat gttgtcccgg ttccanatgg 180 
ggccgaagat cctgttggca aatctcagca ccatgaggtt ctctttccag cacagtgg 23 8 



<210> 36 

<211> 491 

<212> DNA 

<213> Artificial Sequence 



44 



<220> 

<223> Description of Artificial Sequence : G3 - 2D8 sequence 
with C- terminus of GFP, partial BAP-1 sequence 



<220> 

<221> modif ied_base 
<222> (69) 

<223> n = g, a, c or t 



<400> 36 

gagttcgtga 

gccaaggcng 

tctgcacgtc 

tgggcggact 

gggggacttg 

ggggttgggg 

tgggagggct 

gtggttgccc 

cagcacagtg 



ccgccgccgg 
gtggcagcgg 
atcctctcgt 
ggaactcggc 
gcataattgt 
tgaaccccat 
gtgggatggg 
tcagaggctg 
9 



gatcactctc 
tggctccagt 
catcctcata 
tgcggcccac 
gattgtctag 
tgaggctgct 
gcttgtgcgc 
caggggccct 



ggcatggacg 
gtgctggaaa 
gtcatcctca 
acctgccgcc 
aaaggccggc 
gcctggaggc 
atgaaccagc 
gtttgcttcc 



agctgtacaa 
gctaagggca 
tcatctgagt 
aggtcttctt 
agccgctgga 
taccactagc 
cgcctcctcg 
agcaccagcg 



ggaggaggcc 60 
gagttggtgt 12 0 
actgctgggg 180 
cctcctgcat 240 
caatgggagt 3 00 
ttgggtttgt 360 
caccatctgt 420 
gggacctttc 480 
491 



<210> 37 

<211> 34 

<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence : G3 -2D8 sequence 
with C- terminus of GFP, partial BAP-1 sequence 



<400> 37 

Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr 
15 10 15 

Lys Glu Glu Ala Ala Lys Ala Gly Gly Ser Gly Gly Ser Ser Val Leu 
20 25 30 



Glu Ser 



<210> 38 
<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : BstXI linker 
<400> 38 

ccagtgtgct ggaaag 16 



<210> 39 

<211> 16 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence : BstXI linker 
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<400> 39 

ctttccagca cagtgg 16 



<210> 40 
<211> 136 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Gl-2635 sequence, partial NP95 sequence 
<400> 40 

Ala Glu Gin Ser Cys Asp Gin Lys Leu Thr Asn Thr Asn Arg Ala Leu 
15 10 15 

Ala Leu Asn Cys Phe Ala Pro lie Asn Asp Gin Glu Gly Ala Glu Ala 
20 25 30 

Lys Asp Trp Arg Ser Gly Lys Pro Val Arg Val Val Arg Asn Val Lys 
35 40 45 

Gly Gly Lys Asn Ser Lys Tyr Ala Pro Ala Glu Gly Asn Arg Tyr Asp 
50 55 60 

Gly lie Tyr Lys Val Val Lys Tyr Trp Pro Glu Lys Gly Lys Ser Gly 
65 70 75 80 

Phe Leu Val Trp Arg Tyr Leu Leu Arg Arg Asp Asp Asp Glu Pro Gly 
85 90 95 

Pro Trp Thr Lys Glu Gly Lys Asp Arg lie Lys Lys Leu Gly Leu Thr 
100 105 110 

Met Gin Tyr Pro Glu Gly Tyr Leu Glu Ala Leu Ala Asn Arg Glu Arg 
115 120 125 

Glu Lys Glu Asn Ser Lys Arg Glu 
130 135 



<210> 41 

<211> 49 

<212> PRT 

<213> Homo sapiens 

<220> 

<223> G2-2F3 sequence, partial FANCA sequence 
<400> 41 

Arg Lys lie Ser Trp Leu Ala Leu Phe Gin Leu Thr Glu Ser Asp Leu 
15 10 15 

Arg Leu Gly Arg Leu Leu Leu Arg Val Ala Pro Asp Gin His Thr Arg 
20 25 30 

Leu Leu Pro Phe Ala Phe Tyr Ser Leu Leu Ser Tyr Phe His Glu Asp 
35 40 45 

Ala 
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<210> 42 

<211> 552 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : G3 -2H6 sequence 
with C-terminus of GFP, partial DDX9 sequence 



<400> 42 

gagttcgtga 

gccaaggccg 

tccaaagtag 

tctgtagcct 

ggagactcct 

gctgccacca 

ccactgtaac 

ggaggacgtg 

gagggtctac 

ctgatgatag 



ccgccgccgg 
gtggcagcgg 
ccagttccat 
ccactaggcc 
ctatagcctc 
tagcctccgc 
tagaacctcc 
gaccatctcc 
agatctgacg 
ca 



gatcactctc 
tggctccagt 
aggcccccct 
ctctgtagtc 
caccaacacc 
taccatagcc 
ccttctatat 
atgccgtgta 
gatcatgttc 



ggcatggacg 
gtgctggaaa 
accacctcct 
tcctccagag 
tgcaccatat 
tccactgcta 
ccgcttccat 
ctgccaatca 
agcatacgtt 



agctgtacaa 
gcgccacctc 
cgctggaatc 
ttgcctctaa 
cctgcccgaa 
tagccaccgc 
tgtcgtatcg 
taaggttgat 
catttacggg 



ggaggaggcc 
ctcttccctg 
ccccagatcc 
agccacctcg 
aggagttggc 
atagcctcca 
ggccatcttg 
accagcagct 
gtccaactgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

552 



<210> 43 
<211> 117 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : G3 -2H6 sequence 
with C-terminus of GFP, partial DDX9 sequence 

<400> 43 

Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr 
15 10 15 

Lys Glu Glu Ala Ala Lys Ala Gly Gly Ser Gly Gly Ser Ser Val Leu 
20 25 30 

Glu Ser Ala Thr Ser Ser Ser Leu Ser Lys Val Ala Ser Ser lie Gly 
35 40 45 

Pro Pro Thr Thr Ser Ser Leu Glu Ser Pro Arg Ser Ser Val Ala Ser 
50 55 60 

Thr Arg Pro Ser Val Val Ser Ser Arg Val Ala Ser Lys Ala Thr Ser 
65 70 75 80 

Gly Asp Ser Ser lie Ala Ser Thr Asn Thr Cys Thr lie Ser Cys Pro 
85 90 95 

Lys Gly Val Gly Ala Ala Thr lie Ala Ser Ala Thr lie Ala Ser Thr 
100 105 110 

Ala lie Ala Thr Ala 
115 



<210> 44 

<211> 38 

<212> PRT 

<213> Homo sapiens 



47 



<220> 

<223> G3-2H2_1 sequence, partial IGF1R sequence 
<400> 44 

Glu Arg Thr Val lie Ser Asn Leu Arg Pro Phe Thr Leu Tyr Arg lie 
15 10 15 

Asp lie His Ser Cys Asn His Glu Ala Glu Lys Leu Gly Cys Ser Ala 
20 25 30 

Ser Asn Phe Val Phe Ala 
35 




<210> 45 
<211> 347 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :G3 -2G2/2H2 

sequence with C-terminus of GFP, partial UBE2V1 
sequence 

<400> 45 

gagttcgtga ccgccgccgg gatcactctc ggcatggacg agctgtacaa ggaggaggcc 60 

gccaaggccg gtggcagcgg tggctccagt gtgctggaaa ggtgcttctg ggtatttagg 12 0 

tccacattct attttaaggc tgtatattcg gttttcataa attgttcttg gaggcccaat 180 

tatcatccct gtccatcttg taagtgtcat gtcttcgtca tcttctagac cccagctaac 240 

tgtgccatct cctactcctt tctggccttc ttcgagttct tccaacagtc ggaaattgcg 300 

agggactttt actcccgagc ccgtggtggc tctttccagc acagtgg 347 



<210> 46 
<211> 75 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : G3 -2G2/2H2 

sequence with C-terminus of GFP , partial UBE2V1 
sequence 

<400> 46 

Glu Phe Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr 
15 10 15 

Lys Glu Glu Ala Ala Lys Ala Gly Gly Ser Gly Gly Ser Ser Val Leu 
20 25 30 

Glu Arg Cys Phe Trp Val Phe Arg Ser Thr Phe Tyr Phe Lys Ala Val 
35 40 45 

Tyr Ser Val Phe lie Asn Cys Ser Trp Arg Pro Asn Tyr His Pro Cys 
50 55 60 

Pro Ser Cys Lys Cys His Val Phe Val lie Phe 
65 70 75 



48 




<210> 47 

<211> 177 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> portion of G3_2H2 sequence 
<400> 47 

ggcagccacc acgggctcgg gagtaaaagt ccctcgcaat ttccgactgt tggaagaact 60 
cgaagaaggc cagaaaggag taggagatgg cacagttagc tggggtctag aagatgacga 120 
agacatgaca cttacaagat ggacagggat gataattggg cctccaagaa caattta 177 



<210> 48 

<211> 198 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> portion of UbcE2vl alternatively spliced 
transcript 

<400> 48 

ggcacgaggg cgacgcaaga tggcagccac cacgggctcg ggagtaaaag tccctcgcaa 60 
tttccgactg ttggaagaac tcgaagaagg ccagaaagga gtaggagatg gcacagttag 120 
ctggggtcta gaagatgacg aagacatgac acttacaaga tggacaggga tgataattgg 180 
gcctccaaga acaattta 198 



<210> 49 

<211> 112 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> portion of UbcE2vl_2 alternatively spliced 
transcript 

<400> 49 

gtcatcctgc cacgtaaaca ccatcgcatc caccacgtct caccccacga gacctacttc 60 
tgcatcacca caggagtaaa agtccctcgc aatttccgac tgttggaaga ac 112 



<210> 50 

<211> 113 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> portion of UbcE2vl_l alternatively spliced 
transcript 

<400> 50 

aagcgtctta cctgaagtca caaagcaaac tgagtgatga aggaagactt gaacctagaa 60 
aatttcactg caaagggagt aaaagtccct cgcaatttcg actgttggaa gaa 113 



49 




<210> 51 
<211> 192 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> portion of UbcE2vlBs alternatively spliced 
transcript 

<400> 51 

atggcctaca agttccgcac agtgttacag ccctggagta aaagtccctc gcaatttccg 60 
actgttggaa gaactcgaag aaggccagaa aggagtagga gatggcacag ttagctgggg 12 0 
tctagaagat gacgaagaca tgacacttac aagatggaca gggatgataa ttgggcctcc 180 
aagaacaatt ta 192 



<210> 52 

<211> 194 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> portion of UbcE2v2 alternatively spliced 
transcript 

<400> 52 

cgcgtcgggc tgcaggagaa gatggcggtc tccacaggag ttaaagttcc tcgtaatttt 60 
cgcttgttgg aagaacttga agaaggacaa aaaggagtag gcgacggtac agttagctgg 12 0 
ggccttgaag atgatgaaga tatgacactt acaaggtgga caggcatgat tattgggcca 180 
ccaaggacaa atta 194 



<210> 53 
<211> 113 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> portion of UbcE2vl_2 and UbcE2vl_l alternatively spliced 
transcripts 

<400> 53 

gaaggccaga aaggagtagg agatggcaca gttagctggg gtctagaaga tgacgaagac 60 
atgacactta caagatggac agggatgata attgggcctc caagaacaat tta 113 



<210> 54 

<211> 49 

<212> DNA 

<213> Homo sapiens 

<220> 

<223> portion of G3_2H2 sequence 
<400> 54 

cgaatataca gccttaaaat agaatgtgga cctaaatacc cagaagcac 49 



50 



m 



<210> 55 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> portion of UbcE2vl / UbcE2vl_2 , UbcE2vl_l and UbcE2vlBs 
alternatively spliced transcripts 

<400> 55 

cgaatataca gccttaaaat agaatgtgga cctaaatacc cagaagcacc cccctttgta 60 



<210> 56 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<220> 

<223> portion of UbcE2v2 alternatively spliced 
transcript 

<400> 56 

agaatatata gcctgaaagt agaatgtgga cctaaatacc cagaagctcc tccgtcagtt 60 



<210> 57 
<211> 225 
<212> PRT 

<213> Homo sapiens 
<220> 

<223> Uch-13 sequence 
<400> 57 

Met Glu Gly Gin Arg Trp Leu Pro Leu Glu Ala Asn Pro Glu Val Thr 
15 10 15 

Asn Gin Phe Leu Lys Gin Leu Gly Leu His Pro Asn Trp Gin Phe Val 
20 25 30 

Asp Val Tyr Gly Met Asp Pro Glu Leu Leu Ser Met Val Pro Arg Pro 
35 40 45 

Val Cys Ala Val Leu Leu Leu Phe Pro lie Thr Glu Lys Tyr Glu Val 
50 55 60 

Phe Arg Thr Glu Glu Glu Glu Lys lie Lys Ser Gin Gly Gin Asp Val 
65 70 75 80 

Thr Ser Ser Val Tyr Phe Met Lys Gin Thr lie Ser Asn Ala Cys Gly 
85 90 95 

Thr lie Gly Leu lie His Ala lie Ala Asn Asn Lys Asp Lys Met His 
100 105 110 

Phe Glu Ser Gly Ser Thr Leu Lys Lys Phe Leu Glu Glu Ser Val Ser 
115 120 125 

Met Ser Pro Glu Glu Arg Ala Arg Tyr Leu Glu Asn Tyr Asp Ala lie 
130 135 140 
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Arg Val Thr His Glu Thr Ser Ala His Glu Gly Gin Thr Glu Ala Pro 
145 150 155 160 

Ser lie Asp Glu Lys Val Asp Leu His Phe lie Ala Leu Val His Val 
165 170 175 

Asp Gly His Leu Tyr Glu Leu Asp Gly Arg Lys Pro Phe Pro lie Asn 
180 185 190 

His Gly Glu Thr Ser Asp Glu Thr Leu Leu Glu Asp Ala lie Glu Val 
195 200 205 

Cys Lys Lys Phe Met Glu Arg Asp Pro Asp Glu Leu Arg Phe Asn Ala 
210 215 220 

He 
225 



<210> 58 
<211> 231 
<212> PRT 
< 2 1 3 > Homo i 


sapiens 
























<220> 

<223> portion of BAP-1 






















<400> 58 
Met Asn Lys 

1- 


Gly 


Trp 
5 


Leu 


Glu 


Leu 


Glu 


Ser 
10 


Asp 


Pro 


Gly 


Leu 


Phe 
15 


Thr 


Leu Leu Val 


Glu 
20 


Asp 


Phe 


Gly Val 


Lys 
25 


Gly Val 


Gin 


Val 


Glu 
30 


Glu 


He 


Tyr Asp Leu 
35 


Gin 


Ser 


Lys 


Cys 


Gin Gly 
40 


Pro 


Val 


Tyr Gly 
45 


Fne 


± ±e 


rile 


Leu Phe Lys 
50 


Trp 


He 


Glu 


Glu 
55 


Arg 


Arg 


Ser 


Arg 


Arg 
60 


Lys 


Val 


Ser 


Thr 


Leu Val Asp 
65 


Asp 


Thr 


Ser 
70 


Val 


He 


Asp 


Asp 


Asp 
75 


He 


Val 


Asn 


Asn 


Met 
80 


Phe Phe Ala 


His 


Gin 
85 


Leu 


He 


Pro 


Asn 


Ser 
90 


Cys 


Ala 


Thr 


His 


Ala 
95 


Leu 


Leu Ser Val 


Leu 
100 


Leu 


Asn 


Cys 


Ser 


Ser 
105 


Val 


Asp 


Leu 


Gly 


Pro 
110 


Thr 


Leu 


Ser Arg Met 
115 


Lys 


Asp 


Phe 


Thr 


Lys 
120 


Gly 


Phe 


Ser 


Pro 


Glu 
125 


Ser 


Lys 


Gly 


Tyr Ala He 
13 0 


Gly 


Asn 


Ala 


Pro 
135 


Glu 


Leu 


Ala 


Lys 


Ala 
140 


His 


Asn 


Ser 


His 


Ala Arg Pro 
145 


Glu 


Pro 


Arg 
150 


His 


Leu 


Pro 


Glu 


Lys 
155 


Gin 


Asn 


Gly 


Leu 


Ser 
160 


Ala Val Arg 


Thr 


Met 
165 


Glu 


Ala 


Phe 


His 


Phe 
170 


Val 


Ser 


Tyr 


Val 


Pro 
175 


He 
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Thr Gly Arg Leu Phe Glu 
180 

His Gly Pro Trp Gly Glu 
195 

Val He Met Glu Arg He 
210 

Asp He Arg Phe Asn Leu 
225 230 



Leu Asp Gly Leu Lys 
185 

Asp Glu Glu Trp Thr 
200 

Gly Leu Ala Thr Ala 
215 

Met 



Val Tyr Pro He Asp 
190 

Asp Lys Ala Arg Arg 
205 

Gly Glu Pro Tyr His 
220 



<210> 59 
<211> 200 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : poly Gly 
flexible linker 

<220> 

<221> M0D_RES 
<222> (6) . . (200) 

<223> Gly residues from position 6 to 200 may be present 
or absent 

<400> 59 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
15 10 15 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
20 25 30 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
50 55 60 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
65 70 75 80 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
85 90 95 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
100 105 110 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
115 120 125 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
130 135 140 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
145 150 155 160 

Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
165 170 175 
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Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly Gly 
180 185 190 

Gly Gly Gly Gly Gly Gly Gly Gly 
195 200 



<210> 60 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : DEAD box 
conserved motif 

<400> 60 
Asp Glu Ala Asp 
1 
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